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ABSTRACT 

Each  year,  the  United  States  Army  recruits  and  trains  thousands  of  new  soldiers  to 
fill  vacancies  in  Army  organizations  created  by  promotion,  transfer,  or  termination  of 
service.  Installations  responsible  for  training  new  recruits  are  scattered  across  the  United 
States.  Initial  entry  training  for  new  recruits  is  conducted  in  two  phases:  Basic  Combat 
Training  followed  by  Advanced  Individual  Training. 

Proper  management  of  the  Army's  initial  entry  training  program  is  a  very 
complex,  practical  military  logistics  problem  that  demands  timely  scheduling  of  a  broad 
range  of  reusable  training  resources,  such  as,  training  companies.  Currently,  manual 
heuristic  methods  are  used  to  schedule  training  companies  throughout  the  planning 
horizon  to  support  initial  entry  training,  where  training  company  scheduling  also  involves 
deciding  how  many  recruits  to  assign  to  training  companies  each  week.  These  methods 
have  evolved  over  a  number  of  years  when  there  were  few  changes  to  the  training  base, 
and  recruiting  levels  remained  relatively  stationary.  Unfortunately,  there  are  several 
severe  shortcomings  with  these  methods.  For  example,  determining  the  number  of 
recruits  assigned  per  training  company  and  the  number  of  weeks  a  training  company 
remains  busy  training  recruits  is  a  manual  trial-and-error  process.  Second,  it  is  possible 
for  different  analysts  to  generate  different  solutions  for  the  same  recruitment  scenario. 
Third,  no  methods  exist  for  conducting  comparative  analyses  to  appraise  the  quality  of 
competing  feasible  training  schedules.  Finally,  the  temporal  interdependence  of  decisions 
makes  decision  variables  in  the  future  periods  depend  on  current  decision  variables.  This 
complicates  resource  scheduling  and  makes  the  manual  generation  of  week-by-week 
training  schedules  a  tedious,  time-consuming  task. 

This  dissertation:  (1)  formulates  a  mathematical  dynamic  model  of  the  Basic 
Combat  Training  phase  of  initial  entry  training;  (2)  formulates  a  decision  model  for 
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optimally  scheduling  training  resources  based  on  dynamic  programming;  (3)  formulates 
an  improved  heuristic  procedure  for  scheduling  training  resources;  (4)  incorporates  a 
"training  quality"  performance  measure  into  the  formulation  of  the  objective  function 
making  it  possible  to  compare  competing  feasible  training  schedules  obtained  by  various 
methods;  and  (5)  designs,  develops  and  implements  a  fully  operational  computer-based 
decision  support  system  (DSS)  for  scheduling  basic  training  resources. 

The  computational  experiments  reveal  that  the  heuristic  procedures  developed  are 
indeed  computationally  efficient  and  provide  "good"  solutions  in  terms  of  training 
"quality,"  resource  utilization,  and  training  cost. 
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1.  INTRODUCTION 

Each  year,  the  United  States  Army  recruits  and  trains  thousands  of  new  soldiers  to 
fill  vacancies  in  Army  organizations  created  by  promotion,  transfer,  or  termination  of 
service.  Responsibility  for  initial  entry  training  belongs  to  the  U.S.  Army  Training  and 
Doctrine  Command  (TRADOC)  headquartered  at  Fort  Monroe,  Virginia.  The 
installations  responsible  for  training  new  recruits  are  scattered  across  the  United  States  as 
shown  in  Figure  1. 


Figure  1.  Initial  Entry  Army  Training  Installations  (1988) 


As  new  recruits  arrive  at  initial  entry  training  installations,  the}'  are  temporarily 
assigned  to  a  reception  battalion  where  they  are  in-processed  before  beginning  their  initial 
entry  training.  Ideally,  the  amount  of  time  recruits  are  assigned  to  the  reception  battalion 
does  not  exceed  one  week. 

Entry  level  training  consists  of  two  phases:  Basic  Combat  Training  (BCT), 
nomially  lasting  eight  weeks,  followed  by  Advanced  Individual  Training  (AIT)  which 
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varies  from  five  to  fifty  weeks.  The  variability  in  time  of  AIT  reflects  curriculum 
differences  across  the  AIT  programs  administered  at  different  installations.  In  One- 
Station-Unit  Training  (OSUT),  BCT  and  AIT  take  place  at  the  same  installation.  Figure 
2  illustrates  an  aggregated  view  of  the  initial  entry  training  process  where,  for  each 
training  phase,  the  training  companies  from  all  installations  can  be  ^•iewed  as  a  single 
group  of  reusable  resources,  and  an  important  aspect  of  scheduling  is  detemiining  how 
many  recruits  to  assign  to  training  companies  each  week. 


Figure  2.  Aggregated  View  of  the  Initial  Entry  Training  Process 


Figure  3  breaks  down,  or  disaggregates,  the  training  process  by  training  phase 
and  training  installation.  The  arcs  connecting  BCT  and  AIT  training  installations  in 
Figure  3  represent  the  flow  of  recruits  who  complete  Basic  Combat  Training  and  continue 
on  with  Advanced  Individual  Training. 

Proper  management  of  the  military's  initial  entry  training  program  is  a  very 
complex  task  that  requires  timely  scheduling  of  a  broad  range  of  reusable  training 
resources  needed  for  training  new  recruits.  Modeling  the  initial  entry  training  process  is 
complicated  by  the  combinatorial  complexity  of  (1)  the  decision  space  and  (2)  the  state 
space  needed  for  computing  optimal  scheduling  decisions  (see  Section  3.1). 
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Figure  3.  Disaggregated  View  of  the  Initial  Entry  Training  Process  (1988) 
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In  practice,  reusable  training  resources  are  being  eontinuously  rescheduled  to 
support  training  over  a  rolling  planning  horizon.  Although  numerous  types  of  reusable 
training  resources  may  be  scheduled  to  support  initial  entry  training,  the  scheduling 
problem  presented  here  focuses  on  scheduling  only  one  resource  for  one  training  phase; 
namely,  scheduling  basic  training  companies  for  the  aggregated  Basic  Combat  Training 
phase. 

Present  methods  for  scheduling  training  companies  to  support  Basic  Combat 
Training  rely  on  heuristics  that  have  evolved  over  a  number  of  years  when  recruiting 
targets  have  remained  relatively  stable  and  there  have  been  only  a  few  changes  to  the 
training  installation  complex.  Unfortimately,  severe  shortcomings  exist  with  these 
scheduling  methods,  henceforth  referred  to  as  heuristics-used-in-pracrice  (HUIP).  For 
example,  determining  the  number  of  recruits  assigned  per  training  company  {company 
strength)  and  the  number  of  weeks  the  training  companies  remain  busy  training  a 
particular  group  of  new  recruits  {training  cycle  length)  is  essentially  a  manual  trial-and- 
error  process.  Second,  it  is  possible  to  generate  different  schedules  for  allocating  training 
resources  for  the  same  recruitment  (i.e.,  training)  scenario.  Third,  no  methods  exist  for 
making  comparative  analyses  to  appraise  the  quality  of  competing  feasible  schedules  for 
training  resource  allocations.  Finally,  the  interdependence  of  the  problem's  decision 
variables  (e.g.,  company  strengths  and  training  cycle  lengths)  causes  decisions  made  for 
the  current  period  to  impact  future  decision  epochs  (see  Section  3.3,  State  Augmentation, 
for  details).  This  complicates  the  decision  process  and  makes  generating  week-by-week 
training  schedules  a  tedious,  time-consuming  task. 

The  summary  of  the  dissertation  is  as  follows. 

Chapter  2  presents  mathematical  notation  and  a  mathematical  model  of  the  Basic 
Combat  Training  phase  of  initial  entry  training;  henceforth  referred  to  as  the  basic 
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training  problem.  Salient  dynamics  of  Basic  Combat  Training  essential  to  the 
development  of  the  mathematical  model  of  the  basic  training  problem  are  discussed. 

Chapter  3  formulates  an  optimal  decision  model  for  scheduling  training  resources 
based  on  dynamic  programming  [3].  The  value  of  any  training  schedule  may  be 
measured  by  numerous  performance  measures,  such  as,  training  qualit\',  training  costs,  or 
the  lengtli  of  time  required  to  execute  the  training  program,  to  name  a  few.  For  this 
study,  the  basic  training  problem  is  formulated  using  a  "training  quality"  performance 
measure  called  the  instructor-to-student  ratio  (see  Section  2.1).  An  alternate  objective 
that  minimizes  basic  training  costs  is  also  presented. 

Chapter  4  develops  an  efficient  heuristic  scheduling  method  for  the  basic  training 
problem  that  generates  "good"'  training  resource  schedules  in  a  timely  manner.  The 
heuristic  decision  model  is  motivated  by  the  policy  iteration  step  of  dynamic 
programming's  policy  improvement  algorithm.  The  instructor-to-student  ratio  is  also 
used  in  the  heuristic  decision  model  to  discriminate  among  competing  feasible  training 
resource  schedules,  thereby  correcting  one  of  the  shortfalls  of  the  heuristics-used-in- 
practice  (HUIP)  described  above. 

Chapter  5  discusses  the  development  and  implementation  of  a  fully  operational 
computer-based  decision  support  system  (DSS)  capable  of  supporting  analyses  of  a  broad 
range  of  problems  related  to  initial  entry  training  that  features  fully  automated  heuristic 
scheduling  procedures.  Automation  of  the  heuristic  scheduling  process  corrects  another 
one  of  the  major  shortcomings  of  scheduling  with  heuristics-used-in-practice. 

Chapter  6  gives  results  that  compare  the  quality  of  heuristic  scheduling  methods 
for  various  performance  measures  using  a  set  of  twelve  real-world  test  scenarios.  Section 
6.2  discusses  implementation  of  the  DP  algorithm  and  gives  results  comparing  the  quality 
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of  optimal  versus  heuristic  scheduling  methods  for  one  small  problem  using  a  different 
set  of  twelve  test  scenarios. 

Chapter  7  summarizes  the  work,  discusses  the  contribution  of  the  dissertation  and 
presents  some  concluding  remarks  that  include  suggestions  for  future  research. 
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2.  MODEL  FORMULATION 

Before  presenting  the  mathematieal  model  of  the  basic  training  problem,  practical 
aspects  of  Basic  Combat  Training  essential  to  the  proper  development  of  the  dynamic 
training  system  model  are  discussed. 

2.1  Dynamics  of  the  Basic  Training  Problem 

Estimating  the  Weekly  Arrival  of  New  Recruits 

In  the  basic  training  problem,  demand  for  training  resources  depends  upon  the 
number  of  new  recruits  who  report  for  training  each  week.  In  the  real-world  problem,  the 
arrival  of  recruits  is  a  random  parameter  which  makes  the  demand  for  training  resources  a 
random  variable.  Under  these  circumstances,  demand  would  be  specified  by  a  probability 
distribution.  However,  for  the  version  of  the  basic  training  problem  presented  here, 
recruit  arrivals  are  estimated  ahead  of  time  for  each  week  t  and  year  j  of  the  planning 
horizon  Tj,  given  an  annual  recruiting  target  for  each  year  j.  Therefore,  the  absence  of  a 

random  disturbance  makes  this  formulation  of  the  basic  training  problem  completely 
deterministic  (see  Demand  of  Section  3.2  for  additional  details). 

Recruiting  continues  throughout  the  year  and  focuses  heavily  on  young  people  in 
their  final  year  of  high  school.  However,  high  school  seniors  cannot  be  scheduled  for 
Basic  Combat  Training  until  after  graduation  which  normally  occurs  in  late  spring  or 
early  summer.  This  causes  recruit  arrivals  to  surge  during  summer  months  {surge  period) 
with  fewer  arrivals  during  the  rest  of  the  year  {nonsurge  period).  Surge  periods  also 
occur  around  Thanksgiving  and  Christmas  due  to  a  break  in  training  during  the  holiday 
period  (see  Figure  4). 
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Historical  recruit  training  data  for  this  study  was  provided  by  the  U.S.  Army 
Training  and  Doctrine  Command  Headquarters  1  in  the  form  of  monthly  summary  reports 
for  the  years  1984  through  1987.  Figure  4  shows  the  historical  percent  of  annual  recruit 
arrivals  by  month. 


Figure  4.  Percent  of  Annual  Recruit  Arrivals  by  Month 


The  monthly  data  from  Figure  4  has  been  interpolated  into  a  weekly  distribution  of 
recruit  arrivals  over  the  twelve  months  of  the  year.  The  week-by-week  arrival  of  recruits 
is  estimated  by  multiplying  the  annual  recruiting  target,  times  the  weekly  relative 
frequency  distribution  of  arrivals,  times  the  rate  at  which  new  recruits  report  for  Basic 
Combat  Training  (show  rate).  Analysis  of  the  historical  data  reveals  that  although  annual 
recruiting  targets  vary  from  year-to-year,  the  distribution  of  recruit  arrivals  across  the 
year  remains  relatively  stationary.  As  a  practical  matter,  we  recommend  that  the 

1  Courtesy  of  then  Lieutenant  Colonel  David  Hardin  of  the  Planning,  Programming  ,  .Analysis  and 
Evaluation  Directorate  of  Fort  Monroe,  VA. 
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distribution  of  recruit  arrivals  be  updated  at  the  beginning  of  each  year  using  recruit 
arrival  data  from  the  previous  year.  This  is  done  so  that  the  "current"  frequency 
distribution  of  recruit  arrivals  reflects  potential  changes  in  recruiting  trends  that  may 
affect  the  training  requirement. 

Dynamics  of  Varying  Training  Company  Strength 

Company  strength,  the  number  of  recruits  assigned  per  training  company,  is 
bounded  below  at  150  and  above  at  250  recruits.  In  practice,  companies  scheduled  to 
start  training  in  the  same  week  are  initially  assigned  equal  strengths.  This  rule  simplifies 
logistical  problems  associated  with  managing  the  Basic  Combat  Training  program  and  is 
incorporated  into  our  basic  training  model.  Determining  the  company  strength  for  a 
given  week  t  requires  the  following  information: 

1 .  the  number  of  recruits  that  report  for  training  in  week  t;  and 

2.  the  number  of  training  companies  available  at  the  beginning  of  week  t  to  start 
training  new  recruits. 

However,  the  number  of  training  companies  available  to  start  training  in  week  t 
depends  upon  past  company  strength  decisions  (see  Section  3.3,  State  Augmentation). 
For  example,  if  company  strength  decisions  made  prior  to  week  t  resulted  in  company 
strengths  near  the  lower  bound,  then  more  companies  started  training  in  those  weeks  than 
would  have  started  if  company  strengths  had  been  nearer  the  upper  bound.  This  leaves 
fewer  companies  available  to  start  training  recruits  in  week  t.  In  an^'  week,  it  is  possible 
for  previous  weeks'  company  strength  decisions  to  result  in  a  training  company  shortfall, 
where  the  number  of  training  companies  is  not  sufficient  to  handle  the  arrival  of  new 
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recruits  (or  training  load).  In  such  cases,  the  company  strength  decisions  for  previous 
weeks  must  be  sequentially  revised  (if  possible)  to  correct  the  training  company  shortfall. 

Instructor-to-Student  Ratio 

Basic  training  is,  by  design,  highly  stressful  for  new  recruits;  an  aspect  of  training 
that  often  has  a  negative  effect  on  recruit  learning  and  retention.  Training  managers  rely 
on  quality  instruction  and  close  supervision  of  recruits  to  offset  some  effects  of  stress.  In 
practice,  training  managers  use  the  ratio  of  instructors-to-students  as  a  guide  when 
manually  sizing  training  companies  with  the  heuristics-used-in-practice.  Training 
managers  attempt  to  keep  the  instructor-to-student  ratio  around  l-to-16  (or  lower  if 
possible)  resulting  in  company  strengths  of  approximately  200  recruits  per  company. 

The  instructor-to-student  ratio  serves  as  the  primary  objective  in  the  formulation 
of  the  optimal  decision  model  presented  in  Chapter  3.  The  automated  heuristic 
scheduling  methods  developed  in  Chapter  4  also  make  use  of  the  instructor-to-student 
ratio,  both,  as  a  rule  for  establishing  company  strengths,  and  as  a  performance  measure 
for  comparing  suboptimal  solutions. 

Compressing-the-Load 

Each  week,  recruits  who  report  to  basic  training  installations  are  assigned  to 
training  companies.  This  fill  wee^runs  from  Saturday  through  midnight  Thursday.  Basic 
Combat  Training  begins  on  Friday  and,  in  general,  lasts  eight  weeks.  Recruits  who 
graduate  from  BCT  continue  with  Advanced  Individual  Training.  Normally,  a 
maintenance  week  is  scheduled  following  BCT  to  repair  equipment  and  facilities  before 
the  next  training  cycle  begins.  This  ten-week  sequence  is  called  a  normal  training  cycle. 

In  some  cases,  it  may  not  be  possible  to  eliminate  a  training  company  shortfall  by 
adjusting  company  strengths  alone  (see  above).  Another  way  to  correct  a  training 
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company  shortfall  is  to  make  more  companies  available  in  week  t  by  shortening  the 
training  cycles  for  companies  that  started  either  eight  or  nine  weeks  earlier.  This  is  done 
by  eliminating  either  the  fill  week  or  the  maintenance  week,  or  both,  from  those 
company's  (normal)  training  cycles.  However,  this  practice,  called  compressing-the-load, 
can  have  a  negative  impact  on  training  company  cadre.  Eight  weeks  of  basic  training  are, 
in  many  ways,  as  stressful  for  instructors  as  for  recruits.  Cadre  typically  spend  15  hours 
per  day  training  recruits  which  leaves  little  time  for  personal  business.  Reducing  or 
eliminating  the  break  between  training  cycles  can  decrease  cadre  effectiveness  which 
works  against  the  goal  of  quality  instruction.  Therefore,  compressing-the-load  is  only 
used  when  absolutely  necessary  (e.g.,  when  the  demand  for  training  companies  cannot  be 
met  by  adjusting  company  strengths). 

Backlogging  Recruits 

Occasionally,  the  number  of  recruits  who  report  for  basic  training  in  a  given  week 
exceeds  the  number  of  training  spaces  available,  meaning  there  are  not  enough  training 
companies  available  to  enable  all  recruits  to  begin  basic  training  that  week  (i.e.,  a  training 
company  shortfall  occurs).  In  those  instances,  some  recruits  must  be  held  back  until  the 
next  week  to  start  training.  However,  training  experts  observe  that  recruits  who  are  held 
back,  or  backlogged,  tend  to  have  motivational  or  behavioral  problems  leading  to  poor 
performance,  or  worse,  causing  them  to  fail  basic  training.  As  a  result,  training  managers 
avoid  backlogging  recruits  whenever  possible  and  it  is  not  included  in  model  formulation 
at  this  time. 
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2.2  Related  Work 

A  thorough  literature  survey  (see  Appendix  A)  failed  to  surface  any  papers 
directly  related  to  the  basic  training  problem.  However,  the  literature  review  did  locate 
papers  on  related  resource  scheduling  problems  that  generated  ideas  for  attacking  the 
basic  training  problem. 

For  example,  Yang  and  Ignizio  [13]  solve  a  (somewhat)  related  military  resource 
scheduling  problem  in  two  phases  using  two  different  heuristics;  an  approach  that  is 
(coincidentally)  similar  to  the  three-phase,  two-heuristic  method  presented  in  Chapter  4 
for  solving  the  basic  training  problem.  However,  Yang  and  Ignizio  deal  with  the  problem 
of  scheduling  daily  training  activities  for  a  fixed  number  of  Army  baualions^  located  at 
the  same  installation  where  (1)  the  type  and  quantity  of  resources  to  support  training  are 
constrained,  (2)  precedence  relationships  exist  between  tasks,  and,  in  some  cases,  (3)  two 
or  more  battalions  must  work  together  to  accomplish  training  tasks  and  must  share 
resources  as  well.  The  problem  described  by  Yang  and  Ignizio  contains  tens  of  millions 
decision  variables;  they  consider  their  problem  to  be  NP-complete  and  beyond  the  scope 
of  exact  solution  methods.  Therefore,  Yang  and  Ignizio  propose  a  heuristic  method  to 
determine  suboptimal  training  activity  schedules  (by  battalion)  that  minimize  the  total 
time  required  for  the  battalion(s)  to  complete  daily  training  tasks  for  a  finite  plaiming 
horizon.  Training  activity  and  resource  scheduling  is  accomplished  in  two  phases.  In 
phase  one,  a  "greedy"  scheduling  algorithm  makes  a  single  pass  through  the  plaiming 
horizon  to  obtain  an  initial  feasible  schedule  that  is  free  of  scheduling  conflicts  (e.g.,  no 
violations  of  task  precedence  and  no  unresolved  demands  for  training  resources).  Phase 


-An  Army  battalion,  in  general,  consists  of  five  companies;  a  headquarters  company,  a  service  support 
company  and  three  "line"  companies.  Basic  training  battalions  generally  consist  of  a  headquarters 
company  and  five  basic  training  companies. 
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two  improves  the  initial  schedule  using  a  "search  and  reshuffle"  exchange  heuristic  that 
reduces  the  number  of  days  required  to  execute  the  training  program. 

Another  related  problem,  first  introduced  in  1958  by  Manne  [9]  and  Wagner  and 
Whitin  [12],  is  the  economic  lot-sizing  problem  (ELSP)  for  material  requirements 
planning  (MRP)  in  manufacturing  and  inventory  processes.  See  Afentakis  [1],  Afentakis 
et  al.  [2],  Billington  et  al.  [4],  Blackburn  and  Millen  [5],  Crowston  et  al.  [6],  Graves  [8] 
and  Zangwill  [14]  for  extensions  to  the  basic  ELSP  and  different  methods  for  solving  the 
problem.  In  the  economic  lot-sizing  problem,  items  are  produced  in  batches  (lots)  to 
meet  demand  in  the  current  and  future  periods.  As  demand  depletes  inventory,  more  of 
the  item  must  be  periodically  produced  to  prevent  a  stock-out.  In  the  Wagner  and  Whitin 
version,  a  single  item  is  produced  to  meet  known  demand  for  each  period  of  a  finite 
planning  horizon,  where  the  objective  is  to  minimize  the  total  cost  of  setting  up  for 
production,  production,  and  inventory  holding.  Problem  constraints  include  no 
backorders  (i.e.,  a  nonnegative  production  constraint  signifying  no  backlogging  of 
production  to  satisfy  unmet  demand),  and  zero  inventory  on-hand  at  the  start  and  end  of 
the  planning  horizon.  A  balance  equation  accounts  for  inventory  carried  forward  to  meet 
future  demand.  Wagner  and  Whitin  solved  their  problem  using  dynamic  programming 
(DP).  The  DP  algorithm  makes  one  pass  through  the  planning  horizon  to  determine  an 
optimal  production  policy  that  minimizes  total  cost.  The  amount  to  produce  in  each 
period  is  fixed,  so  the  solution  is  completely  specified  by  the  sequence  of  productions 
(e.g.,  the  production  policy)  that  determine  whether  or  not  to  set-up  and  produce  in  each 
period. 

In  some  ways,  the  basic  training  problem  is  similar  to  the  basic  economic  lot¬ 
sizing  problem.  The  aggregated  view  of  initial  entry  training  in  Figure  2  is  equivalent  to 
a  two-phase  (BCT-AIT)  production  process  where  the  training  companies  in  each  phase 
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are  treated  as  a  single  group  of  parallel  servers.  The  "output"  of  the  first  phase  (BCT)  is  a 
single  item  (BCT  graduates).  In  the  BCT  phase,  both  training  company  strengths  (batch 
sizes)  and  training  cycle  lengths  (production  lead  times)  vary  from  period-to-period,  but 
are  constrained  (see  Modeling  Constraints  in  Section  2.3).  The  second  phase  (AIT)  is  a 
multi-item  problem,  where  each  AIT  program  produces  a  different  type  of  "product"  (AIT 
graduate),  and  both  company  strengths  and  training  cycle  lengths  may  vary  across 
installations,  and  within  an  installation  as  well. 

Despite  the  similarities,  there  are  major  differences  between  the  tw'o  problems 
which  prevent  ELSP  solution  methods  from  being  applied  directly  to  the  basic  training 
problem.  For  example,  the  objective  of  the  economic  lot-sizing  problem  is  to  find  a 
production  schedule  that  meets  demand  by  minimizing  production  costs  under  certain 
assumptions  and  constraints.  However,  the  objective  currently  formulated  for  the  basic 
training  problem  seeks  to  maximize  the  "quality"  of  training  in  each  period  of  the 
planning  horizon  (see  Instructor-to-Student  Ratio  above). 

Demand  also  discriminates  between  the  two  problems.  In  the  ELSP,  demand  is 
measured  by  external  demand  for  the  item  being  produced.  An  equivalent  demand  in  the 
basic  training  problem  would  be  demand  for  recruits  by  military  commanders  to  fill 
vacancies  in  their  units.  However,  demand  in  the  basic  training  problem  is  actually 
measured  by  the  requirement  for  training  resources  as  determined  hy  the  time-varying 
(seasonal)  arrival  of  new  recruits  to  start  basic  training  each  period. 

Finally,  the  decision  elements  of  the  two  problems  are  different.  In  the  Wagner 
and  Whitin  version  of  the  ELSP,  the  problem  is  to  determine  whether  or  not  to  produce  a 
fixed  amount  of  the  item  in  each  period  to  meet  demand.  Howe\  er,  there  are  three 
interdependent  decision  elements  in  the  basic  training  problem  that  can  take  on  a  range  of 
values  in  each  period.  The  decision  elements  are: 
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1 .  company  strengths  that  vary  from  1 50  to  250  recruits  per  company; 

2.  training  cycle  lengths  that  eire  generally  ten  weeks  but  ma>'  be  shortened  by 
one  or  two  weeks  to  correct  a  training  company  shortfall;  and 

3.  the  number  of  training  companies  lost  {deactivated)  or  gained  (if  any)  in  each 
period  representing  a  change  to  the  training  base  structure. 

2.3  Mathematical  Formulation  of  the  Basic  Training  Problem 

Mathematical  notation  for  the  problem  is  as  follows. 
j :  year  of  the  planning  horizon,  j  e.  {1,2,...,/}; 

t  :  week  of  a  given  year  y,  /  €  {l,2,...,I}}  where  7}  is  the  number  of  weeks  in 
yeary; 

6(0:  recruit  show  rate  for  week  t  where  0  <  6(f)  <  1; 

p{ty.  relative  frequency  distribution  of  recruit  arrivals  over  week  t  of  any  year; 
Dj :  total  number  of  training  companies  deactivated  in  year  y ; 

Mj :  number  of  training  companies  available  at  the  beginning  of  year y ; 

Rj\  recruiting  objective  for  year  y  determined  by  Department  of  the  Army; 
dj  (f ) :  number  of  training  companies  to  deactivate  in  week  f  of  year  y; 

Vj  (f) ;  estimated  number  of  recruits  that  show  up  for  training  in  week  t  of  year  y; 
Xj  (f ) ;  strength  of  training  companies  starting  in  week  t  of  year y ; 

X :  upper  bound  for  training  company  strength; 

X :  lower  bound  for  training  company  strength; 
yj  (f ) :  training  cycle  length  for  companies  starting  in  week  t  of  >  ear y ; 

Y :  upper  bound  for  training  cycle  length; 
y :  lower  bound  for  training  cycle  length; 
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D*  (t) :  balance  of  training  companies  left  to  be  deactivated  as  of  week  t  of  year  y; 

Ij{ty.  number  of  idle  training  companies  at  the  beginning  of  week  t  of  year  y; 

I :  upper  bound  for  the  idle  training  company  constraint. 

Modeling  Assumptions 

The  major  modeling  assumptions  included  in  the  formulation  of  the  basic  training 
problem  are: 

•  finite  planning  horizon  of  Tj  equal  periods; 

•  varying  but  bounded  training  company  strengths; 

•  varying  but  bounded  training  cycle  lengths; 

•  no  backlogging  of  recruits,  or  equivalently,  no  backlogging  of  the 
requirement  for  training  companies; 

•  changes,  if  any,  to  the  number  of  training  companies  available  for 
training  are  constrained  to  decreases  {deactivations)  in  training 
companies. 

Modeling  Constraints 


X  <  Xj{t) 

<  X: 

company  strength  constraint; 

(1) 

VI 

<  Y: 

training  cycle  constraint; 

(2) 

Mj  >  Dy. 

deactivation  scenario  constraint; 

(3) 

djiO  >  0 

V  (r,y): 

company  deactivation  constraint; 

(4) 

o 

lA 

^  /  V(f,y): 

problem  feasibility  constraint. 

(5) 
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Modeling  Relationships  and  Equations 

The  expected  number  of  new  recruits  to  arrive  for  training  in  week  t  of  year  j  is 

rj{t)  ^  mpit)Rj,  (6) 

where  the  recruiting  objective  Rj,  determined  by  the  Department  of  the  Army  (DA),  is 

greater  than  the  number  of  new  soldiers  required  to  meet  the  needs  of  the  Army  in  a  given 
yearj. 

The  number  of  training  companies  that  remain  to  be  deactivated  as  of  week  t  of 
year  j  is  determined  according  to 

D](t)  =  Dj  -  f  dj(i),  (7) 

i=  1 

where  Dj  =  dj(t).  (8) 

t=  1 

The  expected  number  of  companies  to  begin  training  in  week  t  of  year  j  is  given 
by 


rj{t) 

ipy 


A 

(9) 


where  it  is  important  to  note  that  Xj(t)  is  the  company  strength  decision  to  take  in 
week  t. 

The  number  of  companies  to  become  available  in  week  t  having  just  completed  a 
training  cycle  that  began  I  e  L  weeks  earlier  is 
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y 

/  e  L 


(10) 


The  possible  values  for  /  in  any  week  t  are  within  the  set  L  e  {  (10),  (10,9),  (10,9,8)  }, 
where 


L  =  (10)  =  indicates  only  one  group  of  training  companies  becomes 

available  in  week  t  of  year y;  the  group  that  just  completed  a  normal  10-week 
training  cycle. 

L  =  (10,9)  indicates  two  groups  of  training  companies  become  available  in  week 
t  of  year y;  those  finishing  10-week  and  9-week  training  c}  cles.  The  group 
of  training  companies  finishing  a  9-week  training  cycle  had  its  cycle 
compressed  by  one  week  (see  Section  2.1  for  an  explanation  of  compressing- 
the-load). 

L  =  (10,9,8)  =  indicates  three  groups  of  training  companies  become 

available  in  week  t  of  year  y;  those  finishing  10-week,  9-week,  and  8-week 
cycles.  Therefore,  the  training  cycles  for  training  companies  completing  9- 
and  8-week  training  cycles  were  compressed  by  one  and  two  weeks, 
respectively. 

A  balance  equation  accounts  for  the  number  of  idle  companies  carried  forward  to 
week  t+1  of  year y  to  meet  demand  for  training  companies.  An  idle  training  company  is 
one  that  is  available  but  not  required  to  start  training  at  the  beginning  of  a  week.  The 
balance  equation  for  idle  training  companies  is  given  by 
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=  Ij(t)  +  E 

/  e  L 


Xj{t-l) 


rjjt) 

Xjit) 


djit). 


(11) 


The  number  of  idle  companies  at  the  end  of  a  week  is  always  carried  fonvard,  available  to 
meet  the  training  requirement  in  the  next  week. 

Although  training  companies  is  integer- valued,  computing  (11)  may  generate  a 
non-integer  result  due  to  computing  (9)  and  (10).  To  overcome  this  problem,  (9)  and  (10) 
are  rounded  down  when  Xj{t)  <  X,  and  are  rounded  up  to  the  next  whole  training 

company  when  Xy(r)  =  X.  This  creates  two  cases  for  computing  (1 1): 


Case  1 :  Xj(*)  <  X, 


+  =  Ijit)  + 

Case  2:  Xj(*)  =  X, 

Ij{t  +  1)  =  Ij(t)  + 


E 

I  €  L 


I 

/  e  L 


rj{t) 

Xj(t-l) 

Vjit-l) 

1 - 

J _ 

Xj(t-l) 

Xjit) 

dJit). 


(12) 


(13) 


When  the  current  company  strength  constraint  is  tight  (i.e.,  an  equality  constraint) 
at  the  upper  bound,  then  the  fractional  part  of  the  training  compam'  is  rounded  up,  as 
denoted  by  the  ceiling  operator  f*"].  When  training  companies  are  at  full  strength,  those 
recruits  that  are  represented  by  the  fractional  part  of  a  training  compan>-  can  only  begin 
basic  combat  training  if  an  additional  training  company  is  scheduled  to  start.  In  all  other 
cases,  the  fractional  part  may  be  dropped,  as  denoted  by  the  floor  operator  L*J,  since,  in 
general,  sufficient  training  spaces  will  be  available  in  training  companies  not  filled  to 
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capacity  to  absorb  the  recruits  represented  by  the  fractional  part  of  a  training  company. 
For  simplicity,  the  floor  and  ceiling  operators  of  (12)  and  (13),  respectively,  (denoting  the 
rules  for  rounding  training  companies)  will  not  be  repeated  for  every  future  reference  to 
training  company  computations,  such  as  in  (11).  However,  it  is  to  be  understood  that 
these  rules  are  in  effect  throughout  the  dissertation  unless  stated  other\\ise. 

The  rounding  rules  given  here  for  estimating  the  number  of  training  companies 
required  to  train  recruits  each  week  are  consistent  with  the  rounding  rules  used  in 
practice.  However,  the  practice  of  "redistributing"  the  number  of  recruits  represented  by 
the  "dropped"  fractional  part  of  a  training  company  when  Xj(t)  <  X  fails  to  account  for 
the  true  company  strength  which  is  larger  than  the  prescribed  This  leads  to 

inaccuracies  in  the  estimates  of  (1)  training  "quality"  (as  measured  b>'  the  instructor-to- 
student  ratio;  see  Section  2.1)  and  (2)  training  program  costs  (based  on  variable  costs  per 
trainee).  Therefore,  even  though  we  do  not  expect  radical  changes  in  the  estimates  of 
either  of  these  two  performance  measures,  we  recommend  that  future  modeling  efforts 
account  for  the  redistribution  of  recruits  as  a  means  of  improving  performance  measure 
estimates. 

The  conservation  of  training  companies  equality  constraint  is 


ijd)  +  I 

/  G  L 


Xjit-l) 


+ 


k  =0 


rjit-k) 

Xjit-k) 


(14) 


where 


^min  1  r-{t-k) 

A  Xjit-k) 


and  /  •  =  min  {/  €  L},  represents  the  number  of  "busy" 


training  companies  in  week  t.  The  summation  notation  counts  "busy"  training  companies 
from  the  current  week  ^  (k  =  O),  backward  in  time,  to  week  t  -I +  l  {k  =  I where  I 


in  any  week  t  represents  the  training  cycle  length  decision  /  (see  (10)  above)  from  the  set 

^  rAt-l) 

L  e{  (10),  (10,9),  (10,9,8)  }  for  £ 

iTl 

The  training  "quality"  measure  for  one  period,  denoted  by  the  instructor-to- 
student  ratio  for  week  t  of  year  j  and  assuming  one  instructor  per  training  company  for 
simplicity,  is  given  by 


1 

Xjit) 


(15) 


The  following  training  quality  performance  index  is  used  for  both  the  objective 
function  criterion  in  the  optimal  decision  model  of  Chapter  3,  and  to  compare  competing 
feasible  training  resource  schedules  generated  by  the  heuristic  methods  of  Chapter  4: 


1 


(16) 


We  note  that  the  following  variables  and  parameters  are  integers: 
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3.  OPTIMAL  DECISIONS  USING  DYNAMIC  PROGRAMMING 
In  practice,  satisfactory  training  resource  schedules  for  the  basic  training  problem 
presented  in  Chapter  2  are  obtained  from  heuristic  methods  (see  Section  4.1)  by  making 
scheduling  decisions  sequentially  using  "current"  information.  Exact  solution  methods 
for  obtaining  optimal  scheduling  decisions  include  (1)  integer  or  mixed  integer  programs, 
(2)  complete  enumeration,  and  (3)  dynamic  programming.  The  potential  combinatorial 
complexity  that  may  be  encountered  in  using  these  procedures  to  solve  the  basic  training 
problem  is  briefly  discussed  below.  The  main  reasons  for  choosing  dynamic 
programming  as  the  optimal  solution  method  for  the  problem  are  also  gi\  en. 

3.1  Complexity  of  the  Basic  Training  Optimal  Decision  Problem 

The  complexity  of  using  an  integer  (or  mixed  integer)  program  (IP)  to  solve  the 
basic  training  problem  of  Chapter  2  may  be  illustrated  by  estimation  of  the  number  of 
basic  variables  required  for  one,  of  perhaps  many,  possible  IP  problem  formulations.  For 
this  illustration  we  define  the  following  notation:  time  period  t  e(r  =  96),  training 
company  i  e{l  =  130)  and  training  cycle  length  y  €{1,2,3}  that  correspond  to  training 
cycle  lengths  of  8,  9,  and  10  weeks,  respectively.  The  basic  variables  for  determining  the 
strengths  of  the  training  companies  (to  be  determined  by  the  model)  that  begin  a  training 
cycle  of  length  j  in  each  period  t  are  as  follows: 

the  number  of  recruits  per  training  company  i  at  time  t,  where  is 
integer  and  is  bounded  according  to  equation  (1)  of  Chapter  2; 

g  fl,  if  in  t,  company  i  is  started  with  a  training  cycle  length  of  j,  and 
[O,  otherwise. 
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An  integer  programming  formulation  for  this  problem  generates  approximately  5  x  10“^ 
((130x96)  +  (130x3x96))  basic  integer  variables.  Attempting  to  solve  a  problem  of  this 
size  using  integer  programming  methods  could  be  a  daunting  task,  if  not  a  prohibitive 
one. 

Another  way  to  solve  this  problem  is  to  completely  enumerate  all  possible 
decision  sequences  of  company  strengths  Xj{t)  and  training  cycle  lengths  yj{t),  and  then 

select  the  optimal  sequence  as  the  one  that  maximizes  our  training  "quality"  objective 
function  (see  equation  (16)  of  Chapter  2).  This  decision  sequence  for  the  real-world 
planning  horizon  of  96  periods  is  denoted  by 

|xi(l),  Xj(2),...,X2(96);  | 

[yj(l),  yi(2),...,y2(96)  }' 

An  (upper  bound)  estimate  of  the  possible  number  of  decision  sequences  that  may  be 
enumerated  for  the  real-world  problem,  based  on  101  company  strength  values  and  3 
training  cycle  length  values,  is  (101x3)^^,  or  approximately  1.65  x  10"^^.  Even  though 
many  of  these  decision  sequences  are  infeasible  according  to  the  problem's  state 
feasibility  constraints  (see  (5)  of  Chapter  2),  complete  enumeration  of  the  problem  is  not 
possible. 

A  combinatorial  explosion  of  the  state  space  also  occurs  when  attempting  to 
obtain  an  optimal  solution  to  the  real-world  basic  training  problem  using  dynamic 
programming.  For  example,  the  training  cycle  lengths  of  the  basic  training  problem 
create  a  situation  where  not  all  the  information  that  is  needed  (required)  to  make 
decisions  in  the  "current"  period  can  be  summarized  in  the  state  variable  Ij{t)  that 

describes  the  evolution  of  the  basic  training  system.  To  make  the  necessary  time-lagged 
information  available  for  decision  making  in  the  current  period,  the  framework  of  the 
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basic  training  problem  must  be  reformulated  using  the  technique  of  state  augmentation 

t<5ep,  Ser.tinn  3.3  for  detail.sT  The  augmented  state  space  for  a  single  neriod  of  the  basic 

training  problem  (for  1988  training  data)  requires  enumeration  of  the  following  state 

variables: 

•  number  of  idle  training  companies  7^  (f  + 1) : 

•  company  strength  values  Xj{t  -  \),...,Xj{t  -  9) 

130; 

for  the  augmented  state: 

(101)^; 

•  training  cycle  lengths  yy(0: 

This  generates  an  (upper  bound)  estimate  of  the  size  of  the  state  space,  for  each  period  t, 
of  3jc130x(101)^,  or  approximately  4.27x10^°  (427  million  trillion)  possible  states. 
Although  dynamic  programming  substantially  reduces  the  amount  of  enumeration 
required  to  obtain  an  optimal  solution  by  (1)  avoiding  decision  sequences  that  cannot 
possibly  be  optimal  and  (2)  solving  the  problem  one  stage  at  a  time,  the  potential  size  of 
the  augmented  state  space  for  the  real-world  problem,  or  for  a  reduced  problem  (see 
Section  3.5  and  Table  7  of  Section  6.2)  remains  quite  large.  Complete  enumeration  of  the 
augmented  state  space  for  the  real-world  problem  is  not  possible. 

The  essential  difference  between  the  integer  (or  mixed  integer)  programming 
approach  and  dynamic  programming  is  that  with  the  integer  programming  method  only 
one  optimal  decision  sequence  is  ever  generated.  In  dynamic  programming,  however, 
many  optimal  decision  sequences  (plus  the  objective  function  value  for  each  sequence) 
are  generated  for  whatever  initial  state  and  initial  decision  are  specified.  With  dynamic 
programming  it  is  also  possible  to  obtain  partial  scheduling  solutions  for  "unexpected" 
state  conditions  that  may  occur  at  any  stage  of  the  planning  horizon  by  appealing  to 
Bellman's  Principle  of  Optimality  [3,  p.l2].  Partial  scheduling  solutions  are  generated  by 
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(1)  "re-initializing"  the  problem  with  the  unexpected  state  and  decision  for  the  stage 
where  the  unanticipated  result  occurred,  and  (2)  using  the  pre-computed  optimal 
scheduling  decisions  for  successive  stages  to  obtain  the  optimal  decision  sequence  for  the 
remaining  periods  of  the  planning  horizon.  These  situations  carmot,  in  general,  be 
handled  by  integer  programming  in  a  straight  forward  manner.  Additionally,  DP  is  well 
suited  to  modeling  the  dynamics  of  the  basic  training  system,  and  if  stochastic  variables 
are  introduced  for  recruit  arrivals  and  recruit  failures  (in  future  studies),  then  dynamic 
programming  is  the  only  method,  in  general,  that  can  be  used  for  sequential  decision 
making.  For  the  reasons  given  above,  the  decision  was  made  to  formulate  an  optimal 
decision  model  for  the  basic  training  problem  using  dynamic  programming. 

3.2  The  Basic  Training  Optimal  Decision  Problem 

The  dynamic  programming  (DP)  formulation  of  the  basic  training  problem 
follows  Bertsekas'  [3]  formulation  of  an  inventory  control  problem.  The  dynamic  system 
for  the  inventory  control  problem  is  a  model  fitting  the  general  form  of 

Ut,  Wt),  r  =  l,2,...,r-l,  (17) 

where  the  planning  horizon  is  divided  into  T  time  periods.  Notation  for  the  model  is  as 
follows; 


t\  index  for  T  identical  discrete  time  periods; 

Xj :  state  variable  that  summarizes  information  needed  for  optimization  decisions; 
represents  the  quantity  of  some  item  on-hand  at  the  beginning  of  period  t  in 
the  inventory  control  problem; 
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u^:  decision  variable  denoting  the  decision  made  in  period  t  (e.g.,  the  amount  of 
the  item  to  produce  or  order  in  period  t ); 

random  parameter  denoting  a  disturbance  that  perturbs  the  system.  In  the 
inventory  control  problem  represents  the  level  of  random  demand  in  period 
t  for  an  item,  and  is  specified  by  a  probability  distribution. 


Specifically,  the  state  of  the  inventory  control  system  evolves  according  to  the 
balance  equation 


'■r+l 


=  Xj  +  -  Wj. 


(18) 


The  inventory  control  problem  is  solved  by  determining  the  sequence  of  functions 
Pj(Xj),  for  t  =  l,2,...,r-l,  that  map  the  current  state  to  production  decisions  to 
meet  random  demand  w^,  with  the  objective  of  minimizing  a  specified  performance 
measure  of  system  costs  in  each  period  t.  The  sequence  of  functions  that  map  states  to 
decisions  (actions),  thus  constituting  an  inventory  control  policy  ,  is  gi\  en  by 

TT  =  {  Pj,  P2>-"’P-7'_i  }• 

The  random  parameter  makes  cost,  in  general,  a  random  variable.  denotes  the 
expected  cost  of  the  inventory  system  over  the  planning  horizon  (see  Bertsekas  [3],  12). 
When  the  system  costs  are  additive  over  time  the  total  expected  cost  is  summarized  by 


7t  G  n 


•^71  (-^’j)  “  rnin  I  k 

“1=1 


t=\ . r-i 


(19) 
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where  system  cost  incurred  at  each  stage  t  of  the  inventory  control  problem  includes 
setup  cost,  production  cost  and  inventory  holding  cost,  and  is  denoted  by 
gf[x^,  wj.  The  cost  in  the  last  stage  T,  denoted  by  gj.{Xj),  assumes  that  no 

production  or  ordering  decision  is  made  in  that  stage. 

The  set  of  possible  inventory  levels  x^  at  each  stage  belongs  to  an  inventory  space 
denoted  by  I^,  and  the  production  decisions  belong  to  a  nonempty  subset  of  feasible 
production  decisions  of  the  complete  space  of  production  decisions  (P^.  The 

notation  1)^  (x^ )  means  that  the  elements  belonging  to  depend  on  the  state  x^  and  the 
period  t.  If,  for  all  t,  x^  and  eV^(x^),  then  the  inventor}^  control  policy 

for  stage  t  is  said  to  be  admissible.  The  set  of  all  admissible  inventory  control 
policies  is  denoted  by  IT . 

The  optimal  decision  problem  is  to  determine  the  admissible  inventory  control 
policy  7t*  that  minimizes  the  cost  functional  (19),  for  any  initial  state  Xj.  The  optimal 

inventory  control  policy  is  denoted  by 

♦  J  ♦  *  *  1 

7C  =  1  Pp  P2— Pr-1  1’ 

and  the  corresponding  optimal  cost  function  is 

7,(Xj)  =  min  J^{x{). 

t:  e  n 


In  this  case,  the  model  of  the  inventory  control  problem  must  be  formulated  so  that  the 
inventory  control  policies,  minimize  expected  costs,  however,  in  the  case 

of  "rewards,"  the  objective  function  is  maximized. 

Although  the  structure  of  the  basic  training  problem  is  similar  in  some  ways  to  the 
inventory  control  problem  described  above,  there  are  significant  differences  between  the 
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two  problems  as  well.  Bertsekas  [3]  presents  a  general  dynamic  programming 
formulation  of  the  inventory  control  system  that  evolves  according  to  (18).  However, 
formulating  a  model  of  the  basic  training  problem  within  the  framework  of  dynamic 
programming  requires  making  preliminary  assumptions  concerning  the  structure  of  the 
basic  training  problem.  To  begin,  we  remove  the  training  company  deactivation  decision 

from  the  DP  formulation  of  the  basic  training  problem  by  requiring  training  company 
deactivation  decisions  dj{t)  to  be  made  prior  to  implementing  the  DP  algorithm.  Other 

important  elements  of  problem  structure  are  discussed  below. 

Stages 

In  the  basic  training  problem,  stages  are  specified  by  week  i  and  year  j.  The 
planning  horizon  Tj  consists  of  a  finite  number  of  identical,  discrete  time  periods  where  t 

e  {l,2,...,ry}  andy  €  {1,2,. ..,7}. 

Problem  Decisions  and  Scheduling  Policy 

In  the  basic  training  problem,  company  strength  Xj{t)  and  training  cycle  length 

jjit)  decisions  are  made  at  the  beginning  of  period  f  for  t  =  1,  2,...,  - 1  for  all  training 

companies  that  begin  training  that  period.  A  sequence  of  such  decisions  is  represented  by 

j  x^(l),  Xj(2),...,x^(r-1);  I 
|yj(l),yi(2),...,y/r-l)  J' 

The  decision  spaces  for  Xj{t)  and  Jjit)  are  denoted  by  Q  and  A,  respectively, 
where  Xj{t)  eQ  and  yj{t)  e  A.  The  decision  spaces  Q  and  A  consist  of  the  bounded 
sets  of  integers  specified  by  the  company  strength  constraint  x  <  Xj{t)  <  X  and  the 
training  cycle  constraint  y  <  yj{t)  <  Y.  The  subsets  of  feasible  decisions  to  take  at 
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each  stage  t,  Xj(t)  and  yj(t),  are  denoted  by  Xj  t,  /y(0]  c:  Q  and  Xj  t,  ci  A, 

where  feasible  decision  elements  belonging  to  these  two  subspaces  depend  on  both  the 
stage  t  and  the  state  Ij  (t)  of  the  basic  training  system;  see  below. 

State  of  the  System 

The  state  of  the  basic  training  system  is  characterized  by  the  number  of  idle 
training  companies  I.{t)  that  are  available  at  the  beginning  of  each  period  t  to  start 

training  new  recruits. 

State  Transition  Equation 

The  state  of  the  basic  training  system  evolves  according  to  the  following  balance 
equation  for  idle  training  companies: 


1  ^  t-At-l)  rAt) 

/,(r.i)  =/,[.  /,(0. .,«)]  =  /,«)  fry 

I  ^  L  J  J 

where  fj  t,  I  jit),  Xjit)  is  explicitly  defined  as  an  equivalent  representation  of  the  right 
hand  side  of  (20). 


The  ratio  — —  is  an  estimate  of  the  number  of  companies  to  begin  training  m 

Xjit) 

week  t  of  year  j,  where  Vj  (?)  gives  the  expected  number  of  recruits  to  arrive  for  training 


r- it -1) 

each  week.  The  sum  V  X. -  represents  the  number  of  companies  that  become 

available  at  the  beginning  of  week  t+1  to  start  (another)  training  cycle  having  just 
completed  one  that  began  either  eight,  nine,  or  ten  weeks  earlier  (see  Section  2.1  for  a 
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discussion  of  compressing-the-load).  The  possible  values  for  /  e  L  are  contained  within 
the  set  L  €  {  (10),  (10,9),  (10,9,8)  }  (see  equation  (10),  Section  2.3,  for  further 
details). 


To  maintain  the  integer  value  of  +  both 


Xj{t) 


and  2] 

/  =  L 


are 


rounded  as  follows  (see  Section  2.3  for  further  details): 


For  Xj{*)  <  X: 


Ij(t  +  1)  =  I  jit)  + 


VoxXji*)  =  X\ 


//r  +  1)  =  I  jit)  + 


rjit) 

1 

1 

xl’i 
_ 1 

_ 1 

[y 

1 - 

1  e  L  Xjit-l) 

Xfit) 

djity, 


djit). 


Demand 

In  the  basic  training  problem,  demand  for  idle  training  companies  is  a  function  of 
the  number  of  new  recruits  Vjit)  to  report  for  training  each  week  t.  In  the  real-world 

basic  training  problem,  Vj  it)  is  a  random  parameter.  However,  as  explained  previously  in 
Section  2.1,  recruit  arrivals,  Vjit),  is  estimated  ahead  of  time  for  each  week  t  and  year  j  of 
the  planning  horizon  Tj,  given  an  annual  recruiting  target  Rj  for  each  year  j.  The 
absence  of  random  disturbances  in  the  basic  training  problem  makes  this  version  of  the 
problem  completely  deterministic:  another  major  distinction  between  the  basic  training 
problem  and  the  inventory  control  problem  where  demand  is  a  random  parameter 
specified  by  a  probability  distribution. 
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However,  from  (20)  we  note  that  requirement  for  training  eompanies,  as 
^iit) 

represented  by  — — ,  also  depends  upon  Xjit),  the  deeision  to  take  in  period  t.  This 

Xj  (t) 

dependence  is  illustrated  by  means  of  two  simple  examples.  In  both  of  these  examples, 
superscripts  1  and  2  denote  the  two  cases  we  are  comparing. 


Example  1 :  Dependence  of  Training  Company  Demand  on  Recruit  Arrivals. 


Given:  rj(r)  =  3000,  rj{t)  =  4000  and  Xj{t)  -  200. 

Demand[rj{t)\  =  ^  =  15; 

Demand\rj{t)\^  =  — —  =  20. 

Xj  it) 

Example  2:  Dependence  of  Training  Company  Demand  on  Company  Strength  Decision. 
Given:  Vjit)  =  3000,  xj(0  =  150  and  Xy  (?)  =  250. 

=  20; 

=  12. 


Demand  xj  it)  = 


rjjt) 

xj(r) 


Demand  x 


xjit)_  = 


rjit) 

xjit) 


Objective  Function 

In  dynamic  programming,  the  sequential  decision  process  seeks  to  optimize  an 
appropriately  chosen  objective  function  over  the  entire  planning  horizon  T.  For  the  basic 
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training  system,  the  problem  is  to  find  the  optimal  scheduling  policy  -K*  that  yields  an 
optimal  objective  function  value  J  ,  for  any  fixed  initial  condition  7,(1),  over  all 

admissible  (feasible)  candidate  policies  7i  in  IT,  the  set  of  all  admissible  scheduling 
policies.  for  the  basic  training  problem  is  defined  in  a  manner  similar  to  the  general 

cost  function  given  in  (19)  with  one  major  difference.  The  absence  of  a  random 
parameter  in  the  basic  training  problem  (see  Demand  above),  makes  it  possible  to 
optimally  solve  the  problem  by  making  the  proper  choice  of  company  strength  Xj  {t)  and 

training  cycle  length  yj{t)  decisions  for  each  period  t,  and  for  any  initial  condition, 

without  computing  the  expected  value.  In  this  (deterministic)  case,  the  optimal  company 
strength  and  training  cycle  length  decisions  for  each  possible  state  of  the  basic  training 
system  can  be  pre-computed  for  each  stage  so  that  it  is  not  necessar}'  to  "wait"  until  the 
"current"  period  to  determine  the  decision  to  take  (in  that  period)  (see  [3,  p.9]). 

For  the  deterministic  basic  training  problem,  optimizing  the  objective  function 

over  all  admissible  sequences  of  decisions  leads  to  the  same  optimal  objective  function 
value  7  *  as  optimizing  over  all  admissible  system  control  policies  {  u,,  } 

(following  the  notation  of  the  inventory  control  problem).  For  the  sake  of  brevity,  the 
notation  given  above  for  policies  will  be  used  for  admissible  decision  sequences  denoted 
by 


|yi(l),yi(2),...,y/r-l)  J 


and  the  set  of  all  such  sequences  will  be  also  denoted  by  H.  The  optimal  sequence  of 
decisions  n*  is  the  one  that  maximizes  the  basic  training  objective  function  (see  (21) 
below)  for  a  fixed  initial  state  7^  (1)  where 
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For  the  basic  training  problem,  the  appropriateness  of  an  objective  function 
depends,  in  large  measure,  upon  the  goals  of  the  analysis  and  the  needs  of  decision 
makers.  Two  objective  functions  are  considered  here. 

First,  training  program  managers  may  be  interested  in  obtaining  an  optimal 
training  schedule  that  maximizes  the  quality  of  training.  In  this  case,  an  appropriate 
objective  function  is  one  that  incorporates  a  measure  of  training  quality.  As  discussed 
previously  in  Section  2.1,  one  measure  of  training  quality  that  accounts  for  practical 
aspects  of  scheduling  training  companies  is  the  instructor-to-student  ratio.  When  each 
training  company  is  of  equal  size  (see  Section  2.1,  Dynamics  of  Varying  Company 
Strength),  then  maximizing  the  instructor-to-student  ratio  (i.e.,  minimizing  company 
strengths)  is  equivalent  to  minimizing  idle  training  companies.  If  we  assume  one 
instructor  per  training  company  for  simplicity,  then  for  each  sequence  of  decisions 
7t  e  n,  a  corresponding  value  for  J^,  which  provides  a  measure  of  quality  to  be 

maximized,  is  given  by 

4[  VD]  =  |‘ 

One  possible  objective  that  is  an  alternative  to  maximizing  the  quality  of  training 
(via  the  instructor-to-student  ratio)  is  to  minimize  the  cost  of  basic  training.  For 
simplicity,  only  four  types  of  training  costs  are  considered  in  illustrating  this  objective. 
Notation  for  the  formulation  of  the  cost  function  is 


variable  cost  per  idle  training  company; 
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\  fixed  "Setup"  cost  assessed  whenever  companies  begin  a  training  cycle; 
C  :  variable  training  cost  per  active  traimng  company; 

C^\  Variable  training  cost  per  recruit; 

Pj{t)\  indicator  variable,  where  Pjit)  e  {O,  l}.  Pj(t)  is  zero  when  no  training 
companies  start  training  in  period  t,  and  one  otherwise. 

For  this  formulation,  the  problem  is  to  determine  the  number  of  companies  (and 
corresponding  training  company  strengths)  to  start  training  each  week  (if  any)  that 
minimizes 


/^/^(O)]  =  C^Ij(T)  +  C^^PjiT)  +  C^VjiT) 


*  E 

f=0 


C^IAt)  +  C^^PAt)  +  —  +  C^rAt) 

J  J  Xj(t)  J 


(22) 


subject  to  the  system  constraint  Ij{t  +  1)  =  fj  t,  Ijit),  XjAt)  ,  where  t  =  0,1, - 1 


1 

/.  t.  Lit),  X  At)  is  explicitly  defined  by  (20),  and  — —  is  rounded  according  to  the 
L  J  J  i  Xjit) 


rules  given  above  (see  State  Transition  Equation). 


3.3  State  Augmentation 

In  the  inventory  control  problem,  past  actions  are  summarized  by  the  state 
variable  which,  together  with  current  demand,  is  the  necessary  information  needed  in 

period  t+1  to  make  the  ordering  or  production  decision.  Unfortunately,  in  the  basic 
training  problem,  company  strength  Xj  it)  and  cycle  length  jj  it)  decisions  depend  upon 
past  information  that  cannot  be  summarized  in  Ij  it  + 1)  alone.  This  is  due  to  the 
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dependence  of  decisions  made  in  the  current  period  on  past  decisions  on  strengths  and 
cycle  lengths  of  training  companies;  see  (20). 

To  illustrate  the  interdependence  of  company  strength  decisions,  assume  for  the 

moment,  that  cycle  lengths  are  fixed  at  two  periods,  creating  a  single  period  lag  in  the 
dynamic  model  (here  yj(t)  =  2  and  is  therefore  not  a  decision  variable).  Then  the  state  of 

the  basic  training  system  evolves  according  to  the  following  balance  equation 


+  =  Ijit)  + 


rj(t) 

xj(iy 


(23) 


From  (23),  we  see  that  Ij(t)  of  the  simplified  system  does  not  provide  sufficient 
information  to  enable  the  company  strength  decision  Xj(t)  to  be  made  in  period  t,  or  to 
compute  (23).  From  previous  discussion  (see  Sections  2.1,  2.3  and  3.1),  rj(t)  and 
are  knovwi,  and  the  decision  Xj(t)  depends  (in  this  case)  upon  the  decision 
Xj(t  —  1)  from  the  previous  period.  To  illustrate  the  dependence  of  the  current  decision  in 
time  t  on  the  (past)  decision  in  t-1,  consider  the  following  example,  wiiere  superscripts  1 
and  2  denote  the  two  cases  that  are  being  compared. 


Given:  r^.(0  =  2000,  rj(t  - 1)  =  3000,  x\{t  - 1)  =  150  and  xj{t  - 1)  =  250. 
Demands  for  training  companies  for  the  two  cases  are: 


r.(r-l) 

- =  20  and 

x](t-l) 


xj(t-l) 


=  12. 


Substituting  these  values  into  (23)  yields  following  two  cases: 
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Case  1 :  l]{t +  1) 


/j(r)  + 


rj(t) 

x](t) 


/j(0  +  20- 


2000. 

x]{ty 


Case  2:  lj(t  +  l) 


lj(t)  + 


rj^ 

x](t) 


/y(0  +  12- 


2000 

x]{t)' 


Thus  the  decisions  in  the  current  period,  Jcj(0  and  Xy(0>  obviously  depend  upon 
decisions  jcj(t-l)  and  made  previously  in  period  t-1.  In  other  words,  the 

structure  of  the  illustrative  problem  induces  a  one-period  time  lag  in  the  state  of  the 
system.  Therefore,  in  general,  decisions  made  in  period  t-1  impact  both  the  future  state 
of  system  in  period  t  and  the  allowable  range  of  feasible  decisions  for  period  t.  In  order 
to  apply  dynamic  programming  to  a  problem  of  this  type,  it  is  necessary  to  reformulate 
the  problem  using  a  method  called  state  augmentation  [3].  This  method  makes  it 
possible  to  include  information  from  previous  periods  by  augmenting  the  state  space  with 
the  past  information,  as  needed,  to  make  the  decision  Xj{t)  in  the  current  period  t. 

Therefore,  for  the  illustrative  problem,  an  additional  state  variable 
Sjit)  =  Xj{t  - 1),  t  - 1,  is  introduced  to  form  the  augmented  state  problem 


Ij(t  +  V) 

5/r  +  l) 


fj  t,  Ij{t),  Xj(t),  Sj{t) 


Xjit) 


r.(f-l) 
+  - — 
J  Sj{t) 

Xj{t) 


rj(t) 

Xj{t) 


,  (24) 


where  Sj{l)  =  XjiO)  is  the  (last)  company  strength  decision  from  the  "previous"  plaiming 
horizon.  Defining  the  new  (augmented)  state  of  the  system  as  Ij  (t)  =  Ij  (t),  Sj  (t)  j ,  and 


substituting  Ij  (t)  into  (24)  yields  the  reformulated  problem  without  a  time  lag: 
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(25) 


where  fj  t,  Xj(t)  is  defined  by  the  right  hand  side  of  (24). 


In  terms  of  the  original  variables,  and  for  a  general  one-stage  reward  function 
gy  [t,*  ,*],  the  recursive  DP  algorithm  is  given  by  i 


SIEPJL 


Jj[t,Ij(T),Xj(t-i)]  =  ^^[r,//r),x/r-i)], 

STEP  2: 

Xj_  {T  -  2),  Xj  (T  - 1)]  = 

max  {gj[T-lIjiT-l),Xj(T-2),Xj(T- 1)] 

Xj(T-l)  bXj[T-UIj(T-1),Xj(T-2)] 

+  Jj  [r,  fj  (r  - 1,  Ij  (T  - 1),  Xj  (T-2),  Xj  (T  -  d),  Xj  {T  - 1)]  }, 


and  proceeding  in  this  manner, 
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STEP  t. 


=  max  {gj[t,Ij{t),Xj{t-l),Xj 

Xj(0 


t  +  1,  fj  (t,  Ij  (t),  Xj  (t  - 1),  Xj  (t)),  Xj  (0  }, 


for  t  =  -2  and  j  =  . 


Now,  we  consider  the  more  complicated  case  where  training  cycle  length  is  fixed 
at  y^.(t)  =  10  creating  a  nine-period  time  lag.  Including  additional  variables  to  the 

problem  can  significantly  increase  both  the  number  of  computations  required  to  generate 
an  optimal  solution,  and  the  amount  of  computer  memory  required.  Therefore,  the  state 
space  is  augmented  by  only  the  minimum  number  of  variables  necessary  to  make  a 
decision  in  each  period.  We  begin  by  (initially)  adding  only  one  state  variable  to  the 
problem,  5^  (t)  =  Xj{t-9).  The  augmented  system  becomes 


■//r  +  1) 
s]it  +  r) 


I  jit),  Xjit),  5j(0] 
Xjit-S) 


I  jit)  + 


rjit-9) 

s]it) 

Xjit-8) 


rjjt) 

Xj  it) 


.  (26) 


However,  since  Xjit-8)  is  not  stored  in  either  ^j(r)  or  I  jit),  then  additional 
information  on  Xj  it  -  8)  is  required  by  the  decision  model.  Therefore,  a  second  state 
variable  5^  it)  =  Xj  it  -  8)  is  added  which  further  augments  the  system: 
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r  r/r-9)  r.(0  1 

r  H  r  r  ill  — i -  “  - 

Ijit  +  l)  S^t)]  ■'  5](0  X.{t) 

s](t  +  l)  =  sj(t)  =  sj(t)  .  (27) 

sj(t  +  l)  Xj(t-7)  Xjit-1) 

Similarly,  from  (27)  additional  information  on  Xj  (t  -  1)  is  required.  Continuing 
in  this  fashion,  the  minimally  augmented  system  is  given  by 


One  can  think  of  {  5'j(r),  }  as  "registers"  for  temporarily  storing  the 

required  information  as  the  system  evolves. 

Denoting  Ij  (t)  =  (t),  (t),  sj  (t),...,  (t),  ,  then  the  reformulated  system 

is  Ij{t  +  1)  =fj  t,  J:y(0]»  where  fj[t,*,*]  represents  the  right  hand  side  of  (28). 
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The  DP  algorithm  for  the  nine-period  time  lag  problem  is  given  below  in  Section  3.4  (see 
DP  Algorithm). 

Multiple  training  cycle  lengths  may  be  incorporated  into  the  model  by  augmenting 
the  state  of  the  system  with  additional  variables  denoting  the  values  of  yy(r-lO), 
and  yj(t-%).  Using  the  state  augmentation  technique  to  account  for  past 
training  cycle  length  decisions  follows  the  same  state  augmentation  steps  explained  above 
for  company  strengths. 

3.4  DP  Model  of  the  Basic  Training  Problem 

We  summarize  here  the  DP  model  for  the  problem  with  a  nine-period  time  lag, 
where  the  training  cycle  length  decision  y^it)  is  removed  from  the  decision  model  by 
fixing  the  training  cycle  lengths  at  the  upper  bound  yy(r)  =  10.  The  problem  is  to 
optimize  an  objective  function,  denoted  by  /*,  by  determining  the  optimal  company 
strength  policy  n*  =  {j:*(1),  xl(2),...,  J[:}(r-1)}.  The  problem  formulation  and  the 
DP  algorithm  given  below  are  expressed  in  terms  of  the  new  state  Ij{t). 

Objective  Function: 

maximize  7^[^1,  /i(l)] 

7t  e  n 

Subject  to: 

a)  Decision  Variable  Constraints: 

X  <  Xj{t)  <  X: 


training  company  strength; 
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b)  State  Variable  Constraints: 


+  =  fj  t,  state  transition; 


Ij(t)  <  /  V  (t,j):  constraint  on  the  number  of  idle  companies; 

Ij(t)  >  0  V  (tj):  problem  feasibility  constraint; 


ij(t),  Xj(t),  rj(t). 


rj(t) 

Xj(t)  J 


integer. 


DP  Algorithm : 


STEP  1: 


Jj[t,  lj(T)]  =  s,[t.  Ij(d], 


STEP  2: 


max 


+  J I 


T,fj(T-l,ij{T-l),  x/r-l))]  }, 
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STEP  t: 

i 

max  {  gj [r,  Ij  {t),  Xj  (?)]  +Jj  t  + 1,  fj (/,  Ij  (r),  Xj  (o)  }, 

Xj{t) 

for  t  =  -  2  and  j  = 

3.5  Simplifying  the  Augmented  State  Space  of  the  Problem 

An  unfortunate  consequence  of  state  augmentation  is  the  exponential  explosion  of 
the  state  space.  Practical  limitations  of  computing  technology  constrain  the  size  of  the 
problem  that  can  be  solved  via  dynamic  programming.  As  discussed  in  Section  3.1, 
implementing  the  DP  algorithm  for  the  augmented  basic  training  problem  (for  1988 
training  data)  may  (potentially)  require  enumeration  of  3xl30x(101)^,  or  approximately 
4.27x10^®  possible  states,  which  is  not  possible. 

However,  simplifications  to  the  problem  structure  may  lead  to  a  reduction  in  the 
size  of  state  space  enabling  optimal  scheduling  decisions  from  DP  to  be  compared  with 
heuristic  solutions  to  the  simplified  problem.  For  example,  the  number  of  admissible 
company  strength  values  may  be  reduced  by  restricting  intermediate  values  between  the 
lower  and  upper  bounds  to  be  multiples  of  a  predetermined  step  size.  Using  a  company 
strength  step  of  five,  the  number  of  company  strength  values  to  enumerate  in  each  stage  is 
reduced  from  101  to  21.  Furthermore,  tests  of  realistic  recruiting  targets  using  heuristic 
scheduling  methods  (see  Chapter  4)  reveal  that  there  are  never  more  than  fifty  idle 
training  companies  in  any  period  for  the  set  of  training  companies  tested  and  training 
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cycle  lengths  of  ten  weeks,  resulting  in  the  constraint  I.{t)  <  50.  These  two 
assumptions  significantly  reduce  the  size  of  the  state  space  for  each  period  to 
3x50j:(21)^,  or  approximately  1.19x10^^  (119  trillion)  possible  states.  This  is  still  a 
very  large  problem,  but  six  orders  of  magnitude  smaller  than  the  original  problem. 
Results  comparing  the  dynamic  programming  solutions  to  heuristic  solutions  for  a 
(further)  simplified  problem  are  given  in  Section  6.2. 
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4.  HEURISTIC  APPROACHES 

An  alternative  approach  to  the  dynamic  programming  formulation  of  the  problem 
is  a  heuristic  solution  procedure.  The  heuristic  approach  presented  here  consists  of  two 
different  heuristic  procedures  applied  sequentially  in  three  phases.  The  heuristics  have 
been  implemented  in  a  fully  operational  computer-based  decision  support  system  (DSS) 
(see  Chapter  5)  for  generating  "good"  training  resource  schedules  in  reasonable  time  (see 
Results,  Chapter  6).  Both  heuristics  feature  an  automated  policy  improvement  algorithm 
motivated  by  the  policy  improvement  step  of  dynamic  programming  (see  Bertsekas)  that 
is  tailored  to  fit  the  unique  structure  of  the  basic  training  problem.  Automated  policy 
improvement  eliminates  the  need  for  an  analyst  to  interactively  schedule  resources;  a 
notable  improvement  over  existing  computer-supported  manual  methods  where 
scheduling  is  done  by  trial-and-error. 

The  two  heuristics  of  the  solution  procedure  have  been  implemented  in  a 
spreadsheet  called  LOTUS  1-2-3  for  Windows,  Release  4\  the  same  software  environment 
used  previously  for  implementing  the  heuristics-used-in-practice  (HUIP)  (see  Preliminary 
Work,  Section  5.1).  In  addition,  advanced  macros  provide  programming  flexibility  for 
fully  implementing  the  heuristic  procedures. 

4. 1  Overview  of  the  Heuristic  Approaches 

Phase  I  starts  with  an  initial  training  requirement  for  each  week  t,  denoted  by 
{  rj(l),  rfT),...,rjiT)  },  that  is  estimated  from  the  initial  recruiting  target  Rj  for  each 
year  j.  An  efficient  single-pass  heuristic  (SPH)  makes  one  forward  pass  through  the 
planning  horizon  applying  a  policy  iteration  algorithm  a  finite  number  of  times  in  each 
period  t  until  an  initial  feasible  training  resource  schedule  is  obtained  (if  one  exists)  for 
the  currently  available  resources.  The  training  resource  scheduling  policy  in  Phase  I  is 
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the  sequence  of  decisions  on  company  strength  Xj(t)  and  training  cycle  length  yj{t)  for 
each  period  t.  The  policy  is  specified  by 


ylm,yl(2) . )'){r-i) 


where  superscript  1  denotes  Phase  I. 

Phase  II  considers  options  for  changing  the  level  of  resources  (e.g.,  deactivating 
training  companies)  available  to  train  recruits,  and  is  motivated  by  recent  decisions  to 
dovmsize  the  training  installation  complex.  Incorporating  this  aspect  of  training  base 
management  into  the  heuristic  approach  enables  different  types  of  problems  to  be  studied, 
such  as  evaluating  how  decisions  that  change  the  training  base  structure  impact  training 
resource  scheduling.  Options  for  deactivating  training  companies,  specified  by 
{  rfj(l),  d-^0-),...,dj(T)  },  include  cutting  training  companies  either  at  the  beginning  or 
the  end  of  a  year,  or  distributing  the  cuts  across  the  planning  horizon.  Our  example  is 
restricted  to  one  type  of  training  resource  (i.e.,  training  companies),  and  to  decisions  that 
reduce  the  level  of  available  resources.  However,  the  model  is  easily  modified  to  also 
consider  resource  level  increases  and  multiple  reusable  resources. 

Phase  II  adds  another  decision  variable  (the  number  of  training  companies  to  be 
deactivated  dj{t)  in  each  period  t),  and  another  state  variable  (the  number  of  training 

companies  remaining  to  be  deactivated  D*  for  year  j)  to  the  problem.  Once  model 

parameters  are  adjusted  to  reflect  actual  or  potential  changes  to  the  training  base,  the 
single-pass  heuristic  is  used  (again)  to  return  a  feasible  resource  schedule  (if  one  exists) 
for  the  currently  available  resources.  If  no  resource  changes  are  needed,  then  Phase  II 
may  be  omitted.  The  resource  scheduling  policy  for  Phase  II  is 
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n 


2 


where  df(t)  is  the  company  deactivation  decision  in  period  t,  and  the  superscript  2 
denotes  Phase  II. 

As  explained  earlier,  the  single-pass  heuristic  (SPH)  used  in  Phase  I  and  Phase  II 
makes  one  sequential  forward  pass  through  the  planning  horizon  to  correct  training 
company  shortfalls  in  each  period,  and  then  stops.  Experiments  have  shown  that  it  is 
possible  to  improve  resource  schedules  obtained  via  SPH  by  making  additional  passes 
through  the  planning  horizon  using  a  modified  policy  improvement  step  to  further 
decrease  training  company  strengths.  This  observation  led  to  the  development  and 
implementation  of  a  multi-pass  heuristic  (see  Section  4.3).  The  multi-pass  heuristic 
(MPH)  that  improves  the  resource  scheduling  policies  obtained  fi’om  the  single-pass 
heuristic. 

Phase  III  uses  the  initial  feasible  schedule  from  Phase  II  (or  fi-om  Phase  I  if  Phase 
II  is  omitted)  as  its  starting  point.  The  initial  company  strength  scheduling  policy  is 
iteratively  revised,  period-by-period,  using  MPH  that  works  sequentially  backward 
through  the  planning  horizon  until  no  further  improvements  to  the  objective  function  are 
possible  with  the  MPH.  The  final  resource  scheduling  policy,  obtained  at  the  completion 
of  Phase  III,  is  given  by 
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71^ 


■  y^(lXy^(2),...,yj(r-l); 
d^(l),d^(2),...,dj(T-l) 


where  xj  (!)  denotes  the  "best"  company  strength  decision  obtainable  in  period  t  using 
the  backward  MPH  recursion  of  Phase  III,  and  y^j(t)  and  d^(t)  are  the  training  cycle  and 
training  company  deactivation  decisions  from  Phase  II. 

The  heuristic  procedures  presented  here  are  based  on  the  mathematical 
formulation  of  the  basic  training  problem  presented  in  Chapter  2,  and  assume  a  96-week 
(two-year)  planning  horizon.  Company  strengths  Xj  (t)  are  initialized  at  the  lower  bound 

of  150  recruits.  This  establishes  a  utopian  bound  from  above  for  the  "training  quality" 


performance  measure  (i.e.,  the  instructor-to-student  ratio),  ^  - ,  of  0.64  (which  is 

unachievable,  in  general). 

The  logical  flows  of  the  single-  and  multi-pass  heuristics  are  diagrammed  below 
in  Figures  5-9.  After  the  flow  diagrams,  a  step-by-step  explanation  of  the  procedures  of 
the  two  heuristics  is  also  given. 


4.2  Single-Pass  Heuristic  (SPH) 

Before  Phase  I  begins,  the  training  base  scenario  is  tested  to  determine  whether  a 
feasible  schedule  exists  for  the  scenario  (see  Figure  5).  This  is  done  by  initializing 
training  company  strengths  at  their  upper  bound  and  training  cycle  lengths  at  their  lower 
bound.  If  the  initial  scheduling  shows  that  the  feasibility  constraint 
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>  0  V  (tj))  holds  for  each  period  of  the  planning  horizon,  then  a  feasible 
schedule  exists. 

Phase  I  seeks  an  initial  feasible  schedule  (if  one  exists)  using  two  decision 
variables  (company  strength  Xj(t)  and  training  cycle  length  yj(t)),  and  one  state  variable 
(the  number  of  training  companies  idle  at  the  beginning  of  each  week  Ij  {t)).  The  initial 

conditions  for  Phase  I  assume: 


1 .  an  annual  recruiting  target  Rj  for  each  year  j\ 

2.  company  strengths  Xj{t)  initialized  at  the  lower  bound  of  150  recruits  per 
company; 

3 .  training  cycle  lengths  jj  (?)  initialized  at  the  upper  bound  of  ten  weeks; 

4.  no  changes  to  the  number  of  available  training  companies  to  train  recruits  are 
considered  {djit)  =  0  V  (?,7));  and 

5.  information  needed  to  compute  /j(l),  /j(2),...,/j(9)  is  given  in  the  form  of 


E 

/  e  L 


rjjt-l) 

Xjit-iy 


for  each  case. 


Figure  5  shows  the  logical  flow  of  the  Resource  Scheduling  Algorithm  of  Phase  I. 
Figure  6  diagrams  the  single-pass  heuristic  Company  Strength  Policy  Improvement  Step, 
and  Figure  7  outlines  the  steps  for  the  training  cycle  adjustment  procedure  followed  when 
attempting  to  correct  a  training  company  shortfall  that  could  not  be  corrected  via 
company  strength  policy  improvement. 
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Heuristic 

Policy 

Improvement 

Algorithm 

(continued) 


Apply: 

Cycle  Length  Policy  Adjustment  Step 
to  correct  company  shortfall  in  current  week 


Figure  7.  Phase  I  Training  Cycle  Length  Policy  Adjustment 


PHASE  I:  Finding  an  Initial  Feasible  Training  Resource  Schedule 


1.  Initialize  Phase  I:  set  JCy(r)  =  X,  yj{t)=Y,  and  dj{t)  =  0  V  (tj). 

2.  Compute  weekly  recruit  arrivals /y(0  =  p{t)  Rj  V  (tJ)  given  Rj. 


3. 


Compute  companies  required  in  each  week 


rjjt) 

Xj{t) 


V  {tJ)  as  follows 


For  Xj{t)  <  X: 


rj{t) 

Xjit) 


63 


For  Xj{t)  =  X: 


rj(t) 

Xj{t) 


4.  RESOURCE  SCHEDULING  ALGORITHM: 

A.  Start  the  RESOURCE  SCHEDULING  ALGORITHM  in  week  1  of  year  1. 

B.  Designate  this  week  as  the  current  week  i  . 

C.  Get  the  cycle  length  yj(t)  for  the  current  week  t  . 

D.  Compute  the  next  start  week  for  the  companies  that  begin  a  training  cycle  in 
week  f  as  of  week  f  +  yy(f). 


E. 


Get 


rjji) 

Xj{t) 


for  week  f . 


F. 


Schedule  companies 


to  be  available  to  start  again  in  week  t  +  jj  (t) . 


IF  week  t  +  yj(t)  <  Ty, 

THEN  go  to  week  t ,  move  forward  one  period  in  the  planning 
horizon  from  week  t  of  year  J  to  week  f  + 1,  and  return  to 
STEP  4B.  Otherwise 
CONTINUE. 


IF  week  t  +  yj{t)  >  Ty 

THEN  the  RESOURCE  SCHEDULING  ALGORITHM  has 
completed  scheduling  for  the  entire  planning  horizon. 


CONTINUE. 
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^  rAt-l)  rAt) 

5.  Compute:  +  =  //O  H-  -  —  -  rf/f)  V  (tj). 

6.  HEURISTIC  POLICY  IMPROVEMENT  ALGORITHM  via  AUTOMATED 
POLICY  ITERATION: 

A.  Start  the  POLICY  IMPROVEMENT  ALGORITHM  in  week  1  of  year  1. 

B.  Proceed  sequentially  forward  through  each  week  of  the  planning  horizon  to 
the  first  (or  next)  week  where  there  is  a  training  company  shortfall  (e.g., 
shortfall  week:  SFW),  such  that,  I  jit  + 1)  <  0. 

C.  Designate  this  week  as  the  shortfall  week  t^p^. 

IF  week  t^p^  +  \>Ty, 

STOP;  the  POLICY  IMPROVEMENT  ALGORITHM  has  been 
applied  to  each  week  of  the  planning  horizon  resulting  in  an 
initial  feasible  schedule  tc^  .  Proceed  to  Phase  II  (or  to  Phase 
III  if  Phase  II  is  omitted).  Otherwise 
CONTINUE. 

D.  COMPANY  STRENGTH  POLICY  IMPROVEMENT  STEP: 

IF  1 1  {t^py  + 1)  =  0 ; 

THEN  return  to  STEP  6B.  Otherwise 

CONTINUE. 

IF  ^jit^py^r  "I"  1)  0  and  ^ 

THEN  increase  company  strength  by  one  step  of  size  n,  replace 
XjitpFw^  by  Xjit^py)  +  n,  and  return  to  STEP  6D.  Otherwise 


CONTINUE. 
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IF  IjiisFw  + 1)  <  0  mid  Xj(t^p^)  =  X  and  week  -  /  >  1,  where 
/  e  {  1  }j 

THEN  proceed  sequentially  backward  from  week  + 1  to  the 
first  (or  next)  week  F^^  -  i,  where  i  e  {  1,...,F^^  "  ^  1 
where  -i)<X,  such  that  i  takes  on  values 

sequentially  beginning  with  /  =  1.  Otherwise 

CONTINUE. 

IF  the  current  week  F^^  -  /  <  1 ; 

THEN  further  adjustments  to  company  strengths  cannot  be  made, 
and  it  is  not  possible  to  correct  the  training  company  shortfall 
in  week  F^^  + 1  via  the  COMPANY  STRENGTH  POLICY 

IMPROVEMENT  STEP. 

CONTINUE. 

E.  CYCLE  LENGTH  POLICY  ADJUSTMENT  STEP: 

Attempt  to  correct  the  training  company  shortfall  in  week  F^^  + 1  by 

0 
0 

where  L  e{  (10),  (10,9),  (10,9,8)  }  (see  equation  (10),  Section  2.4,  for 
details). 

IF  adjusting  I  eL  makes  I + 1)  ^  0; 

THEN  return  to  STEP  6B.  Otherwise 


adjusting  training  cycle  lengths  according  to  /  6  L  in  ^ 


0^ 


SFW 


X,{tr 


I  E  L  ■^i^‘‘SFW 
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CONTINUE. 

IF  + 1)  <  0  and  ^ j(.^sFW^  —  X  V  f  s  {  }  and 

^  =  ^MAX  for  week  f^^  +  1; 

THEN  further  adjustments  cannot  be  made  to  either  company 
strengths  or  training  cycle  lengths,  and  it  is  not  possible  to 
correct  the  training  company  shortfall  in  week  + 1 . 

STOP;  the  resource  scheduling  policy  for  the  current 
training  scenario  is  infeasible. 

Phase  II  assumes  the  following  information  from  Phase  I: 

1 .  recruiting  target  Rj ; 

2.  training  scenario  {  rj(l),  r^(2),...,rj(T)  }; 

3.  initial  scheduling  policy  7t^ 

The  Resource  Scheduling  Algorithm  and  the  Policy  Improvement  Algorithm  (Steps  4  and 
6,  respectively)  from  Phase  I  are  applied  (again)  in  Phase  II  to  find  a  feasible  training 
resource  schedule  Figure  8  diagrams  the  single-pass  heuristic  for  Phase  II. 
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PHASE  II: 

Heuristic 

Policy 

Improvement 

Algorithm 


Initialize: 

1 .  set  company  size  to  lower  bound  (LB) 

2.  set  cycle  length  to  upper  bound  (UB) 

3.  select  a  strategy  {dj(t)}  for  making 
training  company  cuts  Dj 


Figure  8.  Phase  II  Single-Pass  Heuristic  Policy  Improvement  Algorithm 
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PHASE  II:  Finding  a  Feasible  Training  Schedule  Subject  To  Changes  in  Training 
Resource  Availability. 

1 .  Given  Dj ,  apply  a  strategy  {  d^ (1),  d^(2),...,dj{T)  }  for  reducing  training 
companies. 

2.  Apply  the  RESOURCE  SCHEDULING  ALGORITHM  from  PHASE  I. 

I 

3 .  Apply  the  HEURISTIC  POLICY  IMPROVEMENT  ALGORITHM  from 

PHASE  1. 

4.  PHASE  II  STOPPING  RULES: 

IFD;(0  =  0  and  V  (rj); 

STOP;  company  deactivation  is  complete  and  a  feasible  resource 
scheduling  policy  7i^  exists  for  the  current  training  scenario. 
Otherwise 

CONTINUE. 

IF  D*{t)  >  0  and  3  week  9  Ijit^py/  + 1)  <  0  and  further 

adjustments  cannot  be  made  to  either  company  strengths  (e.g.,  Xj  (t)  =  X 
for  all  r  G  {  })  or  to  cycle  lengths  (e.g.,  I  =  for  week 

THEN  it  is  not  possible  to  correct  the  training  company  shortfall  in  at 
least  one  week,  namely  + 1 . 

STOP;  the  resource  scheduling  policy  for  the  current  training 


scenario  is  infeasible. 
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4.3  Multi-Pass  Heuristic  (MPH) 

The  starting  point  for  Phase  III  assumes  a  feasible  scheduling  policy  from  either 
Phase  I  or  Phase  II,  or  from  some  other  heuristic  scheduling  method.  The  multi-pass 
heuristic  starts  in  week  Tj  of  the  planning  horizon  and  proceeds  backward  in  time  to  each 
week  i  where  Ij{t)  >  0  and  Xj(t)  >  x.  In  each  week  i  ,  the  company  strength  Xj{i)  is 
decreased  by  one  step  of  size  n  at  a  time  until  Ij{t)  <  0.  After  each  company  strength 
decrement,  the  MPH  checks  each  week  of  the  planning  horizon  {  i  }  to  ensure  the 
last  iteration  of  the  policy  improvement  algorithm  did  not  cause  the  current  policy  to 
become  infeasible  (e.g.,  for  some  t  £ {  }).  The  first  time  that 

decreasing  Xj(t)  causes  Ij(t)  <  0  in  some  week  t  },  then  the  last  decrement 

is  undone  by  increasing  company  strength  in  week  t  by  one  step  of  size  n  to  make 
This  process  is  repeated,  step-by-step  and  period-by-period,  until  no  further 

improvements  can  be  made  to  the  company  strength  policy,  resulting  in  the  "best" 
scheduling  policy  obtainable  via  the  MPH  in  Phase  III.  Figure  9  outlines  the  steps  of 
the  Multi-Pass  Heuristic  Policy  Improvement  Algorithm  for  Phase  HI,  that  is  (again) 
followed  by  the  step-by-step  procedures  for  the  MPH. 


Figure  9.  Phase  III  Multi-pass  Heuristic  Policy  Improvement  Algorithm 


PHASE  III:  Resource  Schedule  Policy  Improvement  via  Multi-Pass  Heuristic 

1 .  Start  in  week  T  of  year  J. 

2.  Proceed  sequentially  backward  through  the  planning  horizon  from  week  T  to  the 
first  (or  next)  week  t  ,  where  Ij (t)  >  0  and  Xj (t)>  x. 

IF  the  current  week  {tj)  <  (1,1); 

THEN  the  MPH  COMPANY  STRENGTH  POLICY 

IMPROVEMENT  ALGORITHM  has  been  applied  to  the  entire 
planning  horizon. 

STOP;  the  policy  is  the  best  one  obtainable  using  the  MPH. 
Otherwise 

CONTINUE. 

3 .  COMPANY  STRENGTH  POLICY  IMPROVEMENT  ALGORITHM  via 
MULTI-PASS  HEURISTIC 

A.  Decrease  company  strength  by  one  step  of  size  n  in  week  f  fromA:y(f)to 
Xj(t)-n. 

B.  Verify  schedule  feasibility  by  checking  each  week  t  },in 

sequence,  proceeding  forward  through  the  planning  horizon  from  week  t  to 
the  end  of  the  planning  horizon  at  week  Tj . 

IF//?)>0  V  f  }; 

THEN  return  to  week  i  in  the  model  and  return  to  STEP  1  of  Phase 
III  (MPH).  Otherwise 
CONTINUE. 


}  3  //o<0; 

THEN  return  to  week  t  in  the  model,  increment  Xjii)  by  one  step  of 
size  It  to  make  lj(i)>0  ,  and 


RETURN  to  STEP  1. 
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5.  DECISION  SUPPORT  SYSTEM  (DSS) 

The  logistical  complexities  of  the  Army's  initial  entry  training  program  create 
numerous  practical  decision  making  problems.  Many  of  these  decision  situations  involve 
problems  where  the  best  solutions  are  not  obvious.  This  may  be  due  to  multiple 
competing  objectives,  or  situations  where  outcomes  depend  upon  a  sequential  decision 
process  complicated  by  precedence  constraints,  or  the  need  to  evaluate  the  impact  of 
decisions  over  varying  planning  horizons. 

The  decision  support  system  presented  here  is  a  robust  paradigm  for  carefully 
modeling,  studying,  and  methodically  solving  problems  related  to  the  Army's  Basic 
Combat  Training  (BCT)  program.  For  this  reason,  the  decision  support  system  is  being 
referred  to  as  the  Decision  Support  System  for  Army  Basic  Combat  Training  Resource 
Management  per  Year'^,  or  ARMY.  Despite  the  current  Army  BCT  orientation  of  the 
system,  it  is  believed  that  the  DSS  can  be  extended  to  other  Army  training  programs,  and 
to  training  programs  of  other  branches  of  military  service  (Navy,  Air  Force,  Marines)  as 
well.  The  ARMY  system  has  been  designed  to  support  decision  making  at  three  major 
levels  of  basic  training  management:  strategic  planning  at  the  Department  of  the  Army 
(DA)  level,  training  installation  management  at  the  U.S.  Anny  Training  and  Doctrine 
Command  (TRADOC)  Headquarters,  and  operational  control  of  the  training  program  at 
the  training  installation  level. 

At  the  Army's  highest  level.  Department  of  the  Army,  strategic  planners  project 

future  recruiting  objectives  to  ensure  that  the  (future)  force  is  properly  manned.  The 

determination  of  recruiting  objectives  leads  to  recruiting  targets,  which  in  turn  drive 

requirements  for  training  resources.  Issues  relating  to  military  force  structure  complicate 

the  strategic  planning  process.  For  example,  the  continued  infusion  of  new  technologies 

1  Courtesy  of  Professor  Emmanuel  Femdndez-Gaucherand,  dissertation  coadvisor,  of  the  Department  of 
Systems  and  Industrial  Engineering  at  the  University  of  Arizona. 


into  Anny  systems  may  lead  to  the  design  of  new  training  programs,  or  to  substantive 
changes  to  curriculum  in  order  to  meet  future  needs  for  soldiers  with  more  highly 
developed  technical  skills.  In  either  case,  potential  changes  to  course  lengths  and  class 
sizes  of  the  training  program  may  directly  impact  recruitment,  training  program 
tliroughput,  training  resource  requirements,  and  training  program  costs.  Downsizing  the 
military  may  also  cause  significant  changes  to  the  structure  of  the  training  installation 
complex,  such  as,  the  consolidation  of  Army  training  programs,  or  the  creation  of  "joint" 
training  centers  for  training  recruits  from  all  branches  of  service  at  the  same  installation. 

At  TRADOC  Headquarters,  training  program  managers  acquire  and  distribute 
training  resources  needed  by  training  installations  to  accomplish  DA  training  missions. 
Management  issues  include  (1)  determination  of  training  loads  for  each  training 
installation,  (2)  justification  of  training  resource  requirements  and  costs  to  DA,  and  (3) 
conducting  studies  and  preparing  plans  for  dealing  with  special  training  contingencies 
(e.g.,  mobilization)  or  potential  changes  to  training  programs  (e.g.,  base  closures, 
downsizing,  etc.). 

At  the  operational  level,  training  program  managers  resolve  resource  scheduling 
problems  that  impact  the  efficiency  and  effectiveness  of  training  program  execution. 
Training  execution  problems  confronting  training  program  managers  at  the  operational 
(installation)  level  include  (1)  developing  precise  training  resource  schedules  that  meet 
training  requirements,  (2)  identifying  and  resolving  training  resource  shortfalls,  and  (3) 
determination  of  efficient  resource  utilization. 

The  ARMY  system  currently  supports  analysis  of  the  strategic,  mid-level,  and 
operational  issues  described  above.  For  example,  studying  varying  course  lengths  using 
ARMY  is  easily  done  since  course  length  is  a  model  parameter  that  may  be  specified  in 
ARMY  hy  "clicking"  on  the  cycle  length  "button"  of  the  user  input  screen  and  entering  the 
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course  length  (value)  from  the  keyboard.  The  ARMY  system  also  features  the  capability 
for  estimating  cost  alternatives  (see  Section  5.4)  based  on  solutions  generated  via  the 
single-  and  multi-pass  heuristics  of  Chapter  4. 

With  the  ARMY  system,  it  is  possible  to  easily  model  both  training  base  changes 
(as  indicated  by  varying  levels  of  basic  training  companies)  and  force  structure  changes 
(as  indicated  by  varying  recruiting  objectives)  since  available  training  companies  and 
recruiting  objectives  are  model  parameters  that  can  be  easily  changed  by  the  user. 
Appendix  B  illustrates  how  the  ARMY  system  might  be  used  to  support  analysis  of 
potential  changes  to  the  training  base  structure. 

The  DSS  offers  several  performance  measures  for  evaluating  training  base  and 
force  structure  scenarios  that  support  the  decision  making  process,  such  as  (1)  training 
program  quality  (via  the  instructor-to-student  ratio),  (2)  training  program  costs,  (3) 
training  program  throughput,  and  (4)  the  level  (quantity)  of  resources  required  to  satisfy 
training  objectives. 

Finally,  provides  training  program  managers  at  training  installations  with  a 

fully  automated  computer-based  DSS  for  generating  "good"  weekly  schedules  of  reusable 
training  resources  in  reasonable  time.  The  have  (potentially)  high  practical  value  as  a 
preliminary  step  in  developing  executable  training  resource  schedules  for  day-to-day 
operations. 

5.1  Preliminary  Work 

The  ARMY  system  is  based,  in  part,  on  past  work  by  McGinnis  [10]  for  the  U.S. 
Army  Training  and  Doctrine  Command  Headquarters.  In  1987,  Congress  established  the 
Base  Realignment  and  Closure  Commission  (BRAC)  to  examine  the  military  installation 
complex,  determine  the  feasibility  of  consolidating  or  realigning  military  missions,  report 
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findings,  and  make  recommendations  to  Congress  for  partial,  or  complete,  closure  of 
military  bases.  In  response  to  the  Commission,  TRADOC  Headquarters  initiated  a 
project  to  develop  computer-based  methods  for  studying  alternatives  (called  training  base 
scenarios)  to  re-structure  and  re-engineer  the  training  installation  complex,  and  to  assess 
the  likely  impact  of  such  changes  on  TRADOC's  ability  to  meet  future  training  missions, 
and  to  maintain  training  readiness.  The  project,  completed  in  1989,  consisted  of  a 
computer-supported,  three-level  modeling  framework,  henceforth  referred  to  as  the  1989 
DSS,  for  studying  training  base  scenarios. 

The  three  levels  of  the  modeling  framework,  long-term  aggregate,  mid-term 
aggregate,  and  near-term  disaggregate,  represent  distinct  top-down  views  of  the  initial 
entry  training  process.  The  planning  horizons  are  four,  two,  and  one  year(s),  respectively. 
The  aggregate  view  of  initial  entry  training  models  training  companies  from  all 
installations  as  a  single  group  (see  Figure  2),  whereas,  the  disaggregate  view  models  each 
training  installation  and  training  program  separately  (see  Figure  3). 

Long-term  aggregate  analysis  of  training  base  scenarios  begins  by  estimating  the 
annual  recruit  throughput  per  training  company  for  each  phase  of  initial  entry  training 
(e.g.,  the  total  number  of  recruits  to  be  trained  by  an  initial  entry  training  company  in  a 
given  year).  Next,  the  number  of  training  companies  required  to  train  new  recruits  each 
year  (based  on  forecasted  annual  recruiting  targets)  is  estimated  as  the  annual  recruiting 
target  divided  by  the  annual  company  throughput.  Then,  the  number  of  training 
companies  available  to  train  recruits  is  estimated  using  "best-case"  assumptions  (e.g., 
training  company  cuts  are  always  made  at  year's  end,  and  company  strengths  for  surge 
and  nonsurge  periods  are  250  and  200  recruits  per  company,  respectively).  Comparing 
the  estimated  number  of  required  versus  available  training  companies  determines  the 
feasibility  of  a  training  scenario  (for  the  assumptions  used).  If  the  number  of  companies 
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required  is  greater  than  the  number  of  companies  available  in  any  year  of  the  four-year 
planning  horizon,  then  the  training  base  scenario  is  infeasible.  Although  not  a 
sophisticated  method,  long-term  aggregate  analysis  proved  to  be  very  useful  for  quickly 
and  safely  eliminating  infeasible  training  base  scenarios  from  further  consideration. 

In  mid-term  aggregate  analysis,  separate  models  of  each  training  phase  (BCT, 
AIT,  OSUT)  are  used  to  generate  a  weekly  training  schedule  via  heuristics-used-in- 
practice  (HUIP).  If  the  scheduling  process  leads  to  a  training  company  shortfall  that 
camiot  be  corrected,  then  the  training  schedule  and  the  corresponding  training  base 
scenario  are  infeasible.  However,  because  training  companies  are  aggregated  across  all 
installations  in  mid-term  aggregate  analysis,  it  is  not  possible  to  identify  where  (at  which 
installations)  the  training  company  shortfalls  occur.  Therefore,  the  mid-term  aggregate 
model  is  disaggregated  into  separate  sub-models  by  installation  and  training  program 
leading  to  near-term  disaggregate  analyses. 

The  HUIP  scheduling  methods  of  mid-term  aggregate  scheduling  are  also  used  for 
near-term  disaggregate  scheduling.  By  solving  each  of  the  disaggregated  scheduling 
problems,  it  is  possible  to  identify  training  resource  shortfalls  by  installation  and  training 
week.  Unfortunately,  problem  disaggregation  causes  the  solution  space  to  explode  which 
necessitates  reducing  the  size  of  the  problem.  Accordingly,  the  plarming  horizon  for  the 
near-term  disaggregate  problem  is  shortened  to  one  year.  Infeasible  scenarios  from  long-, 
mid-  and  near-term  analysis  may  be  iteratively  revised  until  they  become  feasible  either 
by  fixing  the  recruiting  (i.e.,  training)  objective  and  iteratively  increasing  training 
resource  levels  or,  alternatively,  fixing  the  training  structure  (training  companies)  and 
iteratively  decreasing  the  recruiting  objectives  (that  drive  the  training  requirement)  until  a 
feasible  scenario  is  found.  These  approaches  are  also  followed  in  ARMY.  However, 
when  recruiting  objectives  are  fixed,  training  company  shortfalls  (representing  an 
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additional  number  of  training  companies  required  to  meet  the  training  objective)  are 
automatically  determined  by  the  fully  automated  SPH  and  MPH  scheduling  procedures. 

The  1989  DSS  generates  numerical  and  graphical  output  that  is  consistent  in  form 
and  content  with  information  used  at  the  time  to  support  training  resource  scheduling 
scenario  analysis.  The  1989  DSS  partially  automated  some  aspects  of  scheduling  with 
heuristics-used-in-practice  to  substantially  reduce  the  time  to  obtain  a  schedule  from 
several  days  (or  at  best  several  hours)  using  manual  methods,  to  approximately  thirty 
minutes  (implemented  on  a  286  microcomputer).  An  improved  version  of  the  1989  DSS 
is  still  in  use  at  TRADOC  Headquarters  today. 

However,  several  severe  drawbacks  exist  with  both  the  original  and  improved 
versions  of  the  1989  DSS.  First,  resource  scheduling  remains  a  manual  trial-and-error 
process  which  may  lead  to  the  incorrect  classification  of  a  feasible  training  base  scenario 
as  infeasible,  thus  eliminating  a  viable  training  scenario  from  further  consideration. 
Second,  the  trial-and-error  scheduling  approach  of  HUIP  can  result  in  different 
scheduling  solutions  for  the  same  training  base  scenario.  Scheduling  consistency  and 
quality  of  results  when  scheduling  with  HUIP  are  highly  dependent  upon  the  skill  and 
experience  of  the  analyst.  Finally,  the  system  does  not  incorporate  performance  measures 
that  support  comparative  analyses  for  appraising  the  quality  of  competing  feasible 
training  schedules. 

5.2  System  Development  Process 

A  major  effort  has  gone  into  designing,  testing,  and  implementing  a  fully 
operational  Decision  Support  System  for  Army  Basic  Combat  Training  Resource 
Management  per  Year  to  eliminate  the  shortcomings  identified  above,  and  to  improve 
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support  for  decision  making  at  the  major  levels  of  basic  training  management  discussed 
previously.  ARMY  development  followed  four  sequential,  overlapping  tasks. 

Task  1 .  Functional  Description  of  the  ARA4Y  System; 

Task  2.  Preliminary  Design  of  the  Architecture  and  System  Modules; 

Task  3 .  Development  of  the  ARMY  System  Prototype; 

Task  4.  Full  Development  of  the  ARMY  System. 

The  ARMY  development  process  is  shown  in  Figure  10,  along  with  the  subtasks 
accomplished  for  each  major  task. 

The  system  development  steps  follow  the  decision  analysis  process  for  solving 
basic  training  problems  to  the  extent  possible.  These  steps  include  structuring  the  basic 
training  problem,  generating  resource  schedules  for  analyzing  training  scenarios,  and 
representing  system  output  in  formats  that  support  the  decision  process. 

The  main  objective  of  Task  1  was  to  identify  the  primary  functions  of  the  ARMY 
system  in  terms  of  how  the  system  could  best  support  the  decision  process.  In  Task  2, 
system  architecture  was  graphically  represented  through  a  set  of  interconnected  modules 
where  each  module  corresponds  to  a  functional  requirement  of  the  system  (see  Figure 
11).  Module  relationships  were  defined  according  to  four  attributes: 

1 .  module  input  and  output; 

2.  module  functions; 

3.  functional  procedures  that  define  the  logic  and  rules  by  which  each 
module  operates;  and 

4.  flow  of  data  between  modules. 


I 
I 
j 
i 

In  Task  3,  prototypes  of  each  module  were  implemented  within  a  common 
computer  operating  system  (see  below),  and  procedures  were  developed  to  control  the 
flow  of  data  between  modules.  The  final  step  of  Task  3  was  prototype  testing.  In  Task  4, 
the  modules  were  linked  to  form  the  complete  system.  Task  4  concluded  with  full  system 
testing. 


Figure  10.  Decision  Support  System  Development  Phases 


81 


5.3  System  Architecture 

The  ARMY  system  is  based  on  a  modular  design  and  implemented  in  a  computer 
spreadsheet  environment  called  LOTUS  1-2-3  for  Windows,  Release  4  that  provides  a 
fully  integrated  environment  for  model  development.  In  the  LOTUS  1-2-3  spreadsheet, 
the  system  modules  are  dynamically  linked  (see  Figure  11)  to  enable  dynamic  data 
exchange  (DDE).  The  LOTUS  1-2-3  spreadsheet  software  is  ideally  suited  for  handling 
thousands  of  calculations  required  to  generate  a  weekly  training  resource  schedule. 
Advanced  macros  provide  programming  flexibility  for  implementing  and  fully 
automating  the  scheduling  and  policy  improvement  routines,  and  for  streamlining  routine 
scheduling  calculations.  LOTUS  1-2-3  also  features  a  set  of  built-in  statistical  functions 
useful  for  analyzing  resource  scheduling  output  to  support  decision  analysis.  The  ARMY 
system  is  centered  around  the  dynamic  system  model  of  the  Army's  Basic  Training 
Program  presented  in  Chapter  2.  The  modular  system  design  facilitates  tailoring  the 
ARMY  to  other  military  training  programs  as  discussed  previously.  ARMY  architecture 
and  system  modules  are  shown  in  Figure  11.  The  descriptive  module  names  indicate  the 
primary  functionality  of  each  module. 

The  Resource  Scheduler  Module  forecasts  the  number  of  recruits  to  arrive  for 
training  each  period  and  then  computes  the  number  of  training  companies  required  to 
start  training  each  week.  The  module  also  schedules  basic  training  companies  (or  other 
training  resources)  in  each  period  of  the  planning  horizon  based  on  user-specified  initial 
conditions  and  assumptions  (see  Sections  2.3  and  4.2). 

The  Policy  Improvement  Module  invokes  the  single-pass  heuristic  to  generate  an 
initial  feasible  training  resource  schedule  (if  one  exists)  that  is  sequentially  improved  via 
the  multi-pass  heuristic.  The  logical  flow  of  resource  scheduling  and  heuristic  policy 
improvement  is  diagrammed  in  Figures  5  through  9  of  Chapter  4.  Both  the  single-  and 


multi-pass  heuristic  scheduling  algorithms  are  completely  automated,  thus  eliminating  the 
need  for  an  analyst  to  interact  with  the  computer  model  as  is  the  case  when  scheduling 
with  the  heuristics-used-in-practice  of  the  1989  DSS. 
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Figure  1 1 .  Decision  Support  System  Architecture  and  Modules 


The  Numerical  Analysis  Module  summarizes  scheduling  information  and 
computes  scheduling  statistics.  The  Graphical  Analysis  Module  provides  graphical 
output  of  scheduling  information  and  scheduling  statistics.  The  Report  Generation 
Module  prints  numerical  and  graphical  results  in  formats  tailored  to  support  the  decision 
process.  All  graphs  may  be  displayed  to  the  computer  screen  or  printed,  either  from  the 
keyboard  or  by  "clicking"  on  user-friendly  "buttons"  using  a  mouse  pad.  Results  in 
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Chapter  6  gives  examples  of  numerical  and  graphical  output.  Output  from  an  illustrative 
scheduling  session  with  the  system  is  found  in  Appendix  C. 

5.4  Resource  Costing  Module 

The  Resource  Costing  Module  estimates  various  training  program  costs  from  the 
training  resource  schedules  generated  via  the  single-  and  multi-pass  heuristics  of  the 
ARMY  system.  The  idea  of  using  training  resource  schedules  to  generate  training 
program  cost  estimates  was  provided  by  Rao  [11]  who  developed  a  manpower  plarming 
model  analogous  to  Wagner  and  Whitin's  ELSP  model.  Rao’s  model  uses  manpower 
requirements  for  future  periods  to  minimize  manpower  system  costs  for  a  number  of 
fixed  and  variable  recruitment  costs. 

Training  program  resource  costs  are  incorporated  into  ARMY  to  make  the  DSS 
more  supportive  of  training  program  decision  making.  Cost  measures  currently  estimated 
by  ARMY  include  the  following. 

•  Total  annual  costs  for  the  two-year  planning  horizon  including  fixed  costs, 
variable  costs,  costs  by  resource,  and  total  program  cost. 

•  Year-by-year  cost  differences  and  percent  yearly  changes  in  training  costs 
reflecting  training  installation  changes  (attributable  to  training  base  scenarios) 
or  force  structure  changes  (e.g.,  recruiting  targets).  Yearly  cost  differences  are 
computed  for  each  type  of  cost  given  above. 

•  Average  variable  cost  are  determined  per  training  cycle  for  the  by  training 
program,  by  training  battalion,  by  training  company,  and  by  trainee. 

•  Average  variable  costs  are  computed  per  training  company  (per  training  cycle) 
by  resource. 
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•  Average  program  cost  per  recruit,  and  average  program  cost  per  recruit  by 
training  resource  are  also  estimated. 

Typical  cost  data  for  this  study  was  provided  by  the  Directorate  of  Resomce 
Management^  (DRM)  of  Fort  Bemiing,  Georgia.  The  data  is  based  on  a  1993  study  of 
basic  training  resource  utilization  conducted  by  the  Fort  Benning  DRM.  The  1993  Fort 
Benning  Study  measured  fourteen  training  resource  costs  for  a  single  basic  training 
battalion  consisting  of  one  headquarters  company  and  five  basic  training  companies 
during  six  consecutive  training  cycles.  Ten  representative  cost  items  from  the  Fort 
Benning  study  were  selected  to  illustrate  the  costing  methodology.  These  cost  items  are: 
(1)  in-processing  activities,  (2)  basic  training  support,  (3)  supply  operations,  (4) 
maintenance  of  basic  training  equipment  and  materials,  (5)  transportation  services,  (6) 
laundry  services,  (7)  food  services,  (8)  personnel  support,  (9)  ammunition  for  weapons 
qualification,  and  (10)  utilities.  The  four  cost  items  not  used  are  housing  costs  for  linen 
replacement,  engineer  support  for  refuse  collection,  copying  and  printing  costs  for  records 
management,  and  manpower  costs  for  maintenance  of  real  property. 

Training  costs  are  classified  (for  this  study)  according  to  four  categories;  direct- 
fixed,  direct-variable,  indirect-fixed,  and  indirect-variable.  Direct  costs  account  for 
expenditures  directly  related  to  recruit  in-processing  and  basic  training.  Indirect  costs 
represent  the  proportion  of  base  operations  costs  attributable  to  initial  entry  training.  As 
shown  in  Figure  7,  base  operations  costs  may  be  further  broken  down  into  two  categories: 
base  support  services  and  facility  support  services.  Appendix  D  gives  a  partial  listing  of 
base  operations  costs  by  category  (for  reference),  however,  it  is  important  to  note  that 
indirect  training  costs  are  likely  to  vary  across  training  installations,  and  in  some  cases. 


^Courtesy  of  Major  Scott  Manderville,  then  of  the  Directorate  of  Operations  and  Training  (DOT),  Fort 
Benning,  GA. 
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may  not  apply  at  all.  Only  one  indirect  cost,  utilities,  was  reported  in  the  1993  Fort 
Benning  Study.  Numerous  attempts  were  made  by  the  author  to  obtain  additional  indirect 
costs  from  other  training  installations  and  TRADOC  Headquarters  as  well,  however,  none 
of  the  agencies  were  able  to  provide  actual  (or  estimated)  indirect  costs  for  the  initial 
entry  training  program. 


The  1993  Fort  Benning  Study  reports  summarized  annual  fixed  and  variable  costs 
for  the  resources  listed  above.  Therefore,  in  order  to  estimate  resource  costs  for  training 
resource  schedules  from  the  ARMY  system,  it  was  necessary  to  break  down  the  annual 
fixed  and  variable  costs  into  cost  factors.  Training  resource  cost  factors  for  each  cost 
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category  are  given  in  Table  1.  Details  of  cost  factor  computations  are  provided  in 
Appendix  E. 


Cost  Factors 

$/bn/yr 

$/bn/start 

S/co/yr 

$/co/start 

$/recruit 

Personnel  In-Processing 

$27,000 

$85 

Basic  Training  Support 

$83,000 

$2166 

$345 

Supply  Operations 

$61,000 

Maintenance  of  Materials 

$68,000 

$290 

Transportation  Services 

$2500 

$56,000 

$1833 

Laundry  Services 

$145 

Food  Services 

$788 

Personnel  Support 

$26,000 

Ammunition 

$131 

$388 

Utilities: 

$55,500 

GRAND  TOTAL: 

$110,000 

$58,000 

$211,000 

$4130 

$2041 

Table  1 .  Training  Cost  Factors  for  Basic  Training  Analysis 


Fixed  costs  are  generally  assessed  at  the  beginning  of  each  year  (YR)  of  the 
planning  horizon  based  on  either  the  number  of  training  battalions  or  the  number  of 
training  companies,  available  at  the  begiiming  of  each  year.  Both  fixed  and  variable  costs 
are  computed  by  training  battalion  (BN)  and  by  training  company  (CO).  A  variable  cost 
per  recruit,  by  resource,  is  also  determined. 

It  is  important  to  note  that  the  cost  factors  of  Table  1  are  based  entirely  on  Fort 
Benning  training  costs,  and  do  not  reflect  regional  or  seasonal  cost  differences  known  to 
exist  from  installation-to-installation.  Therefore,  the  aggregated  experimental  results  of 
Chapter  6  are  only  meant  to  illustrate  how  cost  factors  can  be  used  to  estimate  basic 
training  resource  costs  from  the  training  resource  schedules  generated  via  the  heuristic 
procedures  of  Chapter  4. 
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Recruit  In-Processing 

Recruit  in-processing  costs  include  costs  associated  with  in-processing  new 
recruits  during  the  fill  week.  The  1993  Fort  Benning  Study  include  the  annual  salary  for 
one  civilian  employee  responsible  for  soldier  in-processing  activities,  and  the  cost  of 
general  supplies  and  organizational  clothing  issued  to  recruits. 

Basic  Training  Support 

Basic  training  support  activities  include  (1)  the  fixed  cost  of  annual  salaries  for 
two  civilian  personnel  providing  administrative  support  to  training  companies,  and  (2) 
variable  costs  for  soldier  issue  items,  company  issue  items,  issue  of  general  supplies, 
cleaning  kits  for  individual  and  crew-served  weapons,  load  bearing  equipment  (LBE), 
and  issue  of  medical  supplies. 

Supply  Operations 

Supply  operations  costs  include  three  fixed  costs  per  training  battalion  for  the 
annual  salaries  of  one  sewing  machine  operator  to  alter  clothing  issued  to  new  recruits, 
one  tailor  to  measure  recruits  for  clothing  issue,  and  one  clerk  to  transcribe  data  and 
process  trainee  records. 

Maintenance  of  Materials 

Maintenance  costs  from  the  1993  Fort  Benning  Study  include  annual  salaries  for 
maintenance  workers,  annual  maintenance  costs  for  direct  support  (DS)  and  general 
support  (GS)  maintenance  of  weapons  and  load  bearing  equipment  (LBE),  and  the  annual 
cost  of  repair  parts  for  weapons  and  LBE. 
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Transportation  Services 

Transportation  costs  pertain  to  both  administrative  and  training  company 
transportation  requirements.  The  1993  Fort  Benning  Study  reports  annual  transportation 
costs  by  type  of  vehicle  computed  as  the  operating  cost  per  mile,  times  the  average  miles 
operated  per  training  cycle,  times  the  number  of  vehicles  operated  (by  type).  The  cost 
module  calculates  an  administrative  transportation  cost  per  training  battalion  for  each  set 
of  five  basic  training  companies  that  start  a  training  cycle,  along  with  training  company 
transportation  costs  per  training  cycle.  An  annual  cost  for  drivers  to  support  training 
companies  is  also  included. 

Laundry  Services 

Weekly  laundry  service  is  provided  to  recruits  during  initial  entry  training.  The 
Fort  Benning  Study  gives  an  annual  cost  for  laundry  service  provided  to  recruits  based  on 
the  six  basic  training  cycles  assuming  200  recruits  per  training  company.  The  laundry 
cost  factor  represents  the  laundry  cost  per  trainee  per  training  cycle.  The  cost  of 
operating  and  maintaining  the  laundry  facility  is  not  included. 

Food  Services 

The  1993  Fort  Benning  Study  reports  an  estimated  annual  cost  for  operating  the 
battalion  dining  facility.  For  the  cost  module,  the  annual  food  service  cost  is  broken 
down  using  the  DRM's  assumptions  of  200  recruits  per  company  and  six  basic  training 
cycles  per  year  to  obtain  a  food  service  cost  per  recruit. 
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Personnel  Support 

Personnel  support  includes  the  annual  salary  paid  to  one  civilian  employee  per 
training  company  responsible  for  providing  administrative  support  to  each  basic  training 
company. 

Ammunition 

Two  types  of  ammunition  are  included  in  the  study:  (1)  ammunition  for  the 
individual  assault  weapon  (Ml 6);  and  (2)  ammunition  for  the  squad  assault  weapon 
(SAW).  Ammunition  cost  factors  are  computed  for  a  fixed  amount  of  ammunition  (called 
the  basic  load)  issued  to  training  company  cadre  for  instruction  purposes,  and  a  variable 
ammunition  cost  per  recruit  for  weapons  training  and  qualification. 

Utilities 

Utilities  is  the  only  indirect  cost  reported  in  the  1993  Fort  Benning  Study. 
Utilities  includes  annual  costs  for  electricity,  natural  gas,  water  services,  and  sewer 
services  for  the  training  battalion.  The  utilities  cost  reported  are  average  cost  based  on 
several  years  of  historical  data. 

5.5  Benefits  of  the  DSS 

ARMY  is  a  user-friendly  computer  software  package  that  can  support  analysis  of 
many  practical  problems  encountered  in  basic  training  management.  Experiments  with 
ARMYYdiVQ  demonstrated  its  capability  to  evaluate  different  scenarios  for  downsizing  or 
realigning  basic  training  installations,  changes  to  recruitment  objectives,  and  scenarios 
that  build-up  the  training  base  structure  as  well.  ARMY  can  be  used  to  support  the 
decision  making  process  by  generating  practical  resource  scheduling  performance 
measures  that  help  decision  makers  select  a  "best"  alternative  from  a  set  of  feasible  ones. 
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Finally,  ARMY  quickly  generates  realistic  training  resource  schedules  that  include 
training  resource  cost  estimates  based  on  resource  utilization. 
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6.  RESULTS 


The  size  of  the  real-world  basic  training  problem  precludes  implementation  of  an 
exact  solution  method  (see  Sections  3.1  and  3.4),  which  could  be  used  as  a  yardstick  to 
measure  the  effectiveness  of  the  heuristic  scheduler.  However,  modifications  to 
assumptions  and  constraints  given  in  Chapter  2  that  further  constrain  the  subsets  of 
feasible  training  resource  decision  elements  (e.g.,  admissible  actions  for  company 

strengths  and  training  cycles)  given  by  Xj  t,  /^.(t)jcQ  and  Of  t,  Ij(t)^cz  A  (see 


Chapter  3),  make  it  possible  to  implement  the  dynamic  programming  technique  for  a 
much  simplified  version  of  the  basic  training  problem.  Section  6.2  compares,  for  twelve 
example  problems,  optimal  scheduling  solutions  using  DP  with  results  from  the  heuristic 
scheduling  procedures  of  Chapter  4. 

First,  however,  we  follow  an  approach  similar  to  that  used  by  Yang  and  Ignizio 
[13]  to  evaluate  their  heuristic  method  for  solving  the  battalion  training  problem 
discussed  in  Chapter  2,  where  an  optimal  solution  to  their  very  large  real-world  problem 
could  not  be  obtained.  Yang  and  Ignizio  evaluated  their  heuristic  method  against  two 
other  heuristics  for  three  different  scheduling  problems.  Results  were  compared  using 
two  performance  measures:  (1)  makespan  of  the  schedule,  and  (2)  CPU  (central 
processing  unit)  time  to  generate  the  schedule. 

We  compare  the  three  heuristic  scheduling  methods  of  Chapter  4 


•  Heuristics-Used-In-Practice  (HUIP); 

•  Single-Pass  Heuristic  (SPH); 

•  Multi-Pass  Heuristic  (MPH). 


using  foiu*  performance  measures  for  twelve  realistic  test  scenarios. 
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6.1  Heuristic  Scheduling  Results 

The  test  scenarios  for  this  study  are  based  on  1988  training  base  structure  and 
illustrative  reeruiting  targets  for  1989  and  1990.  The  number  of  training  companies 

deactivated  (tk)  by  year  j  and  scenario  are  shown  in  Table  2. 


Scenario: 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

D 

-^1989  • 

5 

10 

15 

20 

0 

0 

0 

0 

5 

10 

15 

10 

B 

^1990  • 

0 

0 

0 

0 

5 

10 

20 

30 

5 

10 

15 

10 

Table  2.  Training  Company  Deactivation  Scenarios 


The  annual  recruiting  targets  for  this  study  are  ^1939  =  136,000  and 
=  128,000.  Initially,  130  training  companies  are  available  for  training  recruits  at 

the  beginning  of  1989  (year  1).  Training  cycle  lengths  are  initialized  at  ten  weeks.  For 
all  scenarios,  except  10  and  12,  training  company  strengths  Xj(t)  are  initialized  at  200 

recruits  per  training  company  for  the  HUIP  method  and  150  recruits  for  the  SPH  and 
MPH  methods.  For  Scenarios  10  and  12,  all  three  heuristics  are  initialized  at  150  and  200 
recruits  per  company,  respectively.  Initial  conditions  for  company  strengths,  by 
scheduling  method,  are  summarized  in  Table  3. 


Method  /  Scenario 

1-9, 11 

10 

12 

HUIP: 

200 

150 

200 

SPH: 

150 

150 

200 

MPH: 

150 

150 

200 

Table  3.  Initial  Company  Strength  Values 


Experimentation  with  the  SPH  and  MPH  methods  reveals  that  processing  time 
and  schedule  quality  (as  measured  by  the  instructor-to-student  ratio)  are  highly  dependent 
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upon  the  step  size  n  of  the  Company  Strength  Policy  Improvement  Algoritlim  (see 
Section  4.2).  A  step  size  of  n  =  5  was  chosen  for  this  study  based  on  experiments  with 
step  size  increments  from  one  to  ten,  to  evaluate  tradeoffs  between  processing  time  and 
schedule  quality.  Other  step  sizes  can  be  easily  accommodated  in  the  model. 


Performance  Measures 

The  "goodness"  of  any  training  resource  scheduler  may  be  evaluated  according  to 
different  performance  measures,  such  as  (1)  the  "quality"  of  the  training  schedule 
obtained  (e.g.,  the  instructor-to-student  ratio),  (2)  training  program  cost,  (3)  training 
resource  utilization  as  measured  by  average  (over  the  planning  horizon  or  by  year)  of  idle 
training  companies,  training  company  strength,  or  the  number  of  training  companies  to 
begin  a  training  cycle  each  period,  and  (4)  CPU  time.  For  this  study,  training  resource 
schedules  and  scheduling  methods  are  evaluated  according  to  following  four  performance 
measures: 


1 .  processing  time  -  as  the  time  needed  to  obtain  a  feasible  training  resource 
schedule; 


V  I 

2.  the  training  "quality"  performance  measure,  ^  - -  (instructor-to-student 

t=l 

ratio),  where  we  assume,  for  simplicity,  there  is  only  one  instructor  per 
training  company; 


3. 


the  average  number  of  idle  training  companies  per  period. 


1 


V  t=l 


measure  of  training  resource  utilization;  and 
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4.  basic  training  resource  costs  -  computed  using  the  cost  factors  described  in 
Section  5.4. 

Scheduling  Results 

All  computational  tests  were  conducted  by  the  author  on  a  486/D66 
mierocomputer.  The  user  of  the  computer-supported  HUIP  method  must  interact  with  the 
computer  model  to  manually  determine  the  company  strength  and  cycle  length  decisions 
to  take  in  each  period.  These  decisions  are  determined  automatically  yia  the  SPH  and 
MPH  methods. 

Only  training  base  scenarios  where  a  feasible  training  resource  schedule  exists 
were  considered  to  ensure  a  yiable  comparison  across  test  scenarios  and  scheduling 
methods.  It  is  worth  noting  that  the  author's  first  attempt  to  find  a  feasible  training 
resource  schedule  for  Scenario  8  was  unsuccessful  using  HUIP.  If  a  feasible  training 
resource  schedule  for  this  scenario  was  not  obtained  using  the  SPH  method,  Scenario  8 
might  haye  been  incorrectly  classified  as  infeasible. 

Table  4  (see  below)  giyes  numerical  results  by  training  base  scenario  (Seen)  and 
scheduling  method  {HUIP,  SPH,  MPH)  for  performance  measures  1,  2  and  3.  Results  for 
training  program  costs  are  giyen  separately  in  the  succeeding  subseetion  Comparing 
Training  Program  Costs.  Columns  Dj  and  of  Table  4  correspond  to  rows  and 

of  Table  1,  respectiyely;  each  entry  indicates  the  number  of  training  companies  to 
deactiyate  each  year.  The  columns  Time,  Ratio  and  Idle  Co  correspond  to  performance 
measures  1,  2  and  3,  respectiyely. 

Processing  Time  of  the  heuristic,  measured  manually  with  a  stop  watch,  represents 
the  time  in  minutes  and  seconds  required  to  generate  a  training  resource  schedule.  MPH 
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processing  time  represents  time  required  for  the  MPH  to  improve  the  initial  feasible 
training  resource  schedule  of  the  SPH. 

Ratio  values  in  Table  4  represent  the  instructor-to-student  ratios  summed  over  the 
plamhng  horizon  Tj.  The  utopian  value  for  this  performanee  measure  is 

96  1 

-  =  0.64;  so  named  because  it  will  not  be  attainable,  in  general. 

^  150 

Idle  Co  represents  the  number  of  idle  training  eompanies  averaged  over  the 
plamiing  horizon  (rounded  to  the  nearest  whole  training  company). 


HUIP 

SPH 

mph' 

Seen 

D 

1 

D 

2 

Time 

Ratio 

Idle  Co 

Time 

Ratio 

Idle  Co 

Time 

Ratio 

Idle  Co 

1 

5 

0 

2:57 

.478 

34 

:53 

.581 

8 

:36 

.582 

8 

2 

10 

0 

3:53 

.476 

30 

:58 

.568 

6 

:38 

.569 

6 

3 

15 

0 

5:53 

.473 

25 

1:01 

.554 

5 

:38 

.556 

5 

4 

20 

0 

9:52 

.469 

21 

1:06 

.540 

4 

:47 

.542 

4 

5 

0 

5 

3:03 

.480 

36 

:51 

.585 

9 

:36 

.587 

9 

6 

0 

10 

3:21 

.478 

34 

:53 

.577 

8 

:39 

.578 

8 

B 

0 

20 

3:23 

.471 

31 

:57 

.559 

7 

:44 

.561 

7 

0 

30 

9:57 

.464 

28 

1:05 

.542 

7 

:48 

.544 

7 

9 

5 

5 

5:09 

All 

32 

;56 

.572 

7 

:42 

.574 

7 

10 

10 

10 

10:27 

.515 

8 

1:05 

.549 

6 

:49 

.551 

6 

11 

15 

15 

11:48 

.465 

20 

1:13 

.527 

4 

:55 

.529 

4 

12 

10 

10 

4:51 

.472 

25 

:41 

.470 

22 

11:54 

.549 

7 

Table  4.  Numerical  Results  of  Scheduling  Heuristies 


^Processing  time  for  the  MPH;  does  not  include  processing  time  to  obtain  an  initial  feasible  resource 
'schedule  needed  as  the  starting  point  for  the  MPH  (see  Section  4.3).  For  this  study,  the  initial  schedule  for 
the  MPH  was  generated  using  the  SPH. 
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Comparing  Processing  Times 

Figure  13  graphs  processing  time  for  the  tlu'ee  heuristics.  Observe  that  processing 
times  increase  as  Dj  increases.  However,  the  increase  in  HUIP  processing  time  by 

scenario  is  much  more  dramatic.  Results  for  Scenarios  1  through  11  show  that  the  SPH 
finds  schedules  6.4  times  faster  than  the  HUIP  method,  and  1 .7  times  faster  than  the  MPH 
if  the  time  to  find  an  initial  feasible  schedule  (via  SPH)  is  added  to  MPH  processing  time. 
Similarly,  the  MPH  is  3.7  times  faster  than  the  HUIP  method. 


□  MPH  DSPH  BHUIP 


Processing 

Time 

(min) 


1  2  3  4  5  6  7  8  9  „ph 


Scenario 


Figure  13.  Comparison  of  Heuristic  Processing  Times 

For  Scenario  12,  company  strengths  are  initialized  at  200  recruits  per  training  company 
which  dramatically  reduces  SPH  processing  time,  but  results  in  a  significantly  "poorer" 
initial  feasible  schedule  for  the  SPH  (see  Table  4)  which  negatively  affects  MPH 
processing  time  performance,  since  the  MPH  uses  the  SPH  schedule  as  its  starting  point 
(see  Section  4.3).  Consequently,  many  more  iterations  of  the  MPH's  Company  Strength 
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Policy  Improvement  Algorithm  are  necessary  to  "tighten"  the  solution  in  eaeh  period 
causing  MPH  processing  time  to  increase. 

Comparing  the  Instructor-to-Student  Ratios 

As  discussed  previously,  the  instructor-to-student  ratio  serves  as  a  performance 
measure  of  training  quality  (see  Chapter  2).  Assuming  one  instructor  per  training 
company,  the  ratio  is  the  inverse  of  company  strength  decisions  in  each  period. 
Therefore,  the  problem  is  suboptimized  by  making  company  strengths  as  small  as 
possible.  Figure  14  compares  instructor-to-student  ratios  for  each  heuristic  procedure, 
where  the  utopian  value  of  the  performance  measure  is  0.64. 


QHUIP  DSPH  BMPH 


0.64 


Scenario 


Figure  14.  Comparisonoflnstructor-to-Student  Ratios 


On  average,  the  quality  of  the  MPH  and  SPH  schedules  were  19%  and  18.7% 
better  than  the  HUIP  schedules,  respectively.  The  SPH  and  MPH  generate  higher  quality 
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schedules  for  all  cases  except  Scenario  12,  where  the  HUIP  marginally  outperforms  the 
SPH.  The  reason  for  this  is  that  the  SPH  attempts  to  find  a  feasible  solution  by  increasing 
training  company  strengths  one  step  at  a  time  from  the  current  company  strength  value, 
but  only  in  those  weeks  where  there  is  a  training  company  shortfall  (fy(0  <o).  In 

Scenario  12,  initializing  company  strengths  at  200  recruits  eliminates  a  number  of 
training  company  shortfalls  that  would  have  otherwise  occurred  had  company  strengths 
been  initialized  at  150  recruits  per  company.  Thus,  the  initial  condition  imposes  a 
"penalty"  on  the  quality  of  the  solution  that  can  be  attained  via  the  SPH  for  Scenario  12. 
However,  Scenario  12  also  illustrates  the  how  the  MPH  methodically  tightens  the 
company  strengths  in  each  period  of  the  planning  horizon,  thereby  improving  the 
objective  function  value,  from  0.470  for  the  SPH  to  0.549  for  the  MPH;  a  substantial 
16.8%  improvement  in  quality.  For  Scenarios  1  through  11,  the  MPH  gives  an  average 
improvement  over  the  SPH  solutions  of  only  three  tenths  of  one  percent  (0.3%),  but  as 
shown  in  Figure  13,  the  processing  time  for  the  MPH  is  so  small  (in  general)  that  the 
small  improvement  in  the  "quality"  measure  may  be  worth  getting.  Even  though  this 
modest  gain  may  seem  inconsequential,  it  may  be  quite  significant  in  terms  of  total 
dollars  when  applied  to  annual  training  program  costs. 

In  Figure  15  (see  below),  the  scheduling  results  are  measured  against  the  utopian 
value  for  scheduling  performance.  The  scheduling  results  are  expressed  as  percent  of 
utopian  value,  here  100  percent  represents  an  instructor-to-student  ratio  of  0.64.  From 
Figure  15,  it  is  evident  that  the  "effectiveness"  of  the  result  depends,  in  part,  upon  the 
number  of  training  companies  deactivated  in  the  corresponding  scenario. 
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Figure  15.  Effectiveness  of  Heuristic  Solutions  as  a  Percent  of  the  Utopian 


The  various  scenarios  selected  for  this  study  represent  a  "balanced"  set  of 
problems  (that  is,  balanced  according  to  the  distribution  of  the  severity  of  training 
company  cuts)  for  testing  the  scheduling  procedures.  The  problems  represent  a  range  of 
reasonable  conditions  so  as  to  not  (overly)  bias  the  estimates  of  scheduling  effectiveness. 
Excluding  the  special  cases.  Scenarios  10  and  12,  the  HUIP,  SPH,  and  MPH  methods 
generate  solutions  that  are  (on  average)  approximately  74%,  87.5%,  and  87.8%  of  the 
utopian  value  of  the  "quality"  performance  measure,  respectively.  It  is  believed  that  if 
the  SPH  and  MPH  methods  are  used  (or  tested)  under  similar  conditions  in  the  future, 
then  the  average  performance  of  the  SPH  and  MPH  methods,  under  these  conditions,  will 
be  approximately  87%  of  the  utopian  solution;  substantially  better  than  the  estimated 


74%  for  the  HUIP  method. 
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Comparing  the  Average  Number  of  Idle  Training  Companies 

The  third  perfomiance  measure,  average  number  of  idle  training  companies, 
evaluates  training  resource  schedules  in  terms  of  training  company  utilization.  Note  that 
the  objective  of  finding  the  maximum  instructor-to-student  ratio  (by  making  company 
strengths  as  small  as  possible  in  each  period),  coincidentally,  drives  idle  training 
companies  (or  other  idle  economic  resources)  to  minimal  levels.  ARMY  supports  basic 
training  decision  making  by  generating  useful  experimental  data,  such  as  the  average 
number  of  idle  training  companies,  that  helps  to  detennine  whether  or  not  the  training 
base  is  either  over-  or  under-structured  relative  to  projected  recruiting  targets.  Figure  16 
compares  idle  training  companies  for  the  various  training  base  scenarios. 


Figure  16.  Comparison  of  Idle  Training  Companies 


Figure  16  reveals  that  for  Scenarios  1  through  8,  the  average  number  of  idle 
training  companies  declines  with  training  company  deactivations  (as  expected):  when 
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fewer  companies  are  available  to  train  recruits,  then  a  higher  proportion  of  the  companies 
still  available  must  be  pressed  into  use. 

Next,  observe  that  for  all  scenarios  except  12,  idle  training  companies  for  the  SPH 
and  MPH  are  exactly  equal.  This  is  because  the  SPH  Company  Strength  Policy 
Improvement  Algorithm  is  designed  to  satisfy  the  scheduling  feasibility  constraint 

{lj(t)  ^  o)  by  increasing  company  strengths  in  those  weeks  where  Ij(t)  <  0  in  steps  of  n 
until  lj(t)  =  0.  The  instant  when  the  feasibility  constraint  becomes  tight  (i.e.,  when 

equality  holds),  the  algorithm  stops.  Therefore,  although  the  MPH  is  able  to  tighten 
company  strengths  in  each  period  when  company  strengths  are  initialized  at  the  lower 
bound,  as  with  the  SPH,  the  MPH  cannot  further  reduce  idle  training  companies  because 
the  SPH  makes  these  constraints  tight  in  each  period.  However,  when  training  companies 
are  initialized  at  a  value  greater  than  the  lower  bound,  such  as  at  200  recruits  as  in 
Scenario  12,  then  the  condition  Ij(t)  >  0  may  occur  (in  many  periods).  Then  MPH's 

refinement  of  the  company  strengths  leads  to  a  reduction  of  idle  training  companies  as 
well  (see  results  for  Scenario  12).  Differences  in  idle  companies  between  the  SPH/MPH 
and  the  HUIP  methods  (excluding  Scenario  12)  shows  that  the  SPH/MPH  are  3.8  times 
more  effective,  on  average,  at  reducing  the  number  of  idle  training  companies  than  the 
HUIP  method. 

Comparing  Training  Costs 

The  training  costs  presented  here  are  based  on  the  ten  cost  items  listed  in  Section 
5.4.  These  analyses  illustrate  how  cost  factors  might  be  used  to  estimate  training  resource 
costs  for  schedules  produced  via  the  ARMY  system.  Although  these  cost  estimates  are 
only  illustrative,  it  is  believed  that  incorporating  additional  training  program  costs  into 
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the  system  will  lead  to  realistic  training  resource  cost  estimates  that  support  both  long- 
range  plamiing  and  current  operations. 


1989  Costs  for  Scenario  11 

SPH 

HUIP 

Total  Program  Cost 

$286,896,573 

$285,764,013 

Total  Fixed  Cost  (BN  &  CO) 

$26,795,000 

$26,795,000 

Total  Variable  Cost  (BN,  CO,  &  Recruit) 

$260,101,573 

$258,969,013 

Average  Variable  Cost  Per  Training  Cycle 

$384,745 

$432,342 

Average  Program  Cost  Per  Recruit 

$2,344 

$2,335 

1990  Costs  for  Scenario  11 

SPH 

HUIP 

Total  Program  Cost 

$267,794,283 

$267,070,703 

Total  Fixed  Cost  (BN  &  CO) 

$23,300,000 

$23,300,000 

Total  Variable  Cost  (BN,  CO,  &  Recruit) 

$244,494,283 

$243,770,703 

Average  Variable  Cost  Per  Training  Cycle 

$397,135 

$430,349 

Average  Program  Cost  Per  Recruit 

$2,325 

$2,318 

Table  5.  Summary  of  Training  Resource  Costs 


Table  5  summarizes  training  program  costs  for  training  resource  schedules 
obtained  with  HUIP  and  SPH  for  Scenario  11  (see  Appendix  F  for  additional  cost 
estimates). 

The  ARMY  system  includes  options  for  viewing  costs  estimates  graphically. 
Figure  17  compares  the  program  cost  estimates  for  the  schedules  generated  using  the  SPH 
and  HUIP  methods  for  all  of  the  twelve  "1989"  training  base  scenarios. 

These  results  show  that  the  costs  for  the  HUIP  method  are  consistently  lower  than 
the  total  program  costs  of  the  SPH  method.  This  should  be  expected  since  SPH  generates 
higher  "quality"  schedules  than  the  HUIP  procedure  and  higher  quality  training  costs 
more.  This  result  is  due,  primarily,  to  the  fewer  training  starts  required  by  the  HUIP 
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method  because  of  it  initializes  company  strengths  at  200  recruits.  The  differences  in  the 
way  the  SPH  and  HUIP  schedules  are  obtained  may  also  significantly  impact  training 
program  execution,  training  program  funding,  and  training  resource  allocation. 


Figure  17.  Comparison  of  Total  Training  Program  Cost  by  Scenario 


6.2  Comparison  of  Optimal  and  Heuristic  Scheduling  Results 

In  this  section,  we  compare  the  SPH-MPH^  and  DP  scheduling  methods  for 
solving  a  simplified  problem  to  observe  the  differences  in  the  quality  of  optimal  versus 
heuristic  solutions.  As  explained  in  Section  3.1,  the  state  space  for  one  stage  of  a  real- 
world  basic  training  problem  may  contain  3jc130x(101)^  (or  approximately  4.27x10^*^) 
possible  states.  The  high  dimensionality  of  the  real-world  problem  makes  it  impractical 
to  implement  an  exact  solution  procedure  such  as  dynamic  programming.  However,  it  is 

notation  denotes  that  the  SPH  is  used  to  generate  the  initial  feasible  training  resource  schedule 
used  as  the  starting  point  for  the  MPH. 
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possible  to  use  dynamic  programming  to  solve  a  simplified  version  of  the  problem  to 
make  a  comparison  of  heuristic  versus  DP  results.  Accordingly,  we  make  the  following 
modifications  in  characterizing  the  problem  scenario: 

•  training  cycle  length  decisions  are  removed  from  consideration  by  fixing  cycle 
lengths  at  yj(t)  =  2  tlmoughout  the  planning  horizon  (i.e.,  the  time  lag  in  the 

dynamics  is  reduced  from  ten  periods  to  one  period); 

•  the  maximum  allowable  number  of  idle  training  companies  is  reduced  from 
130  to  20  (this  bound  was  determined  by  experimentation  with  SPH); 

•  allowable  (feasible)  company  strength  decisions  are  reduced  from  101  (for  a 
step  of  1)  to  21  (corresponding  to  a  step  size  of  5). 

These  simplifications  reduce  the  state  space  (for  one  period)  by  seventeen  orders  of 
magnitude:  from  possible  states  for  the  real-world  problem  to  9,261 

(21x21x21)  possible  states  for  the  current  problem.  The  planning  horizon  is  reduced 
from  96  to  48  weeks^,  thus  establishing  a  new  utopian  upper  bound  of  0.32  for  the 
"quality"  performance  measure.  Although  reducing  the  number  of  stages  does  not  reduce 
the  state  space,  it  does  lower  the  requirement  for  computer  memory  and  reduces  the  CPU 
time  for  generating  an  optimal  solution. 


Implementation  of  the  DP  Algorithm 

The  state  transition  equation  for  a  one-period  lag  problem  is  given  by 


I(t  +  1)  =  lit)  + 


rjt  - 1) 

x(f  - 1) 


xitY 


^Note  the  subscript  j  denoting  the  year  of  the  planning  horizon  can  be  dropped  since  48  weeks  correspond 
to  a  one-year  planning  horizon. 
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where  company  strengths  x(t)  are  restricted  to  the  allowable  set  of  integer-valued 
decisions  defined  in  Section  3.2  as  X[t,  I(t)]  a  Q.  This  may  seem  to  be  a  "trivial" 
problem  that  has  little  (or  no)  practical  value  (to  basic  training  management)  as  far  as 
real-world  decision  making  is  concerned.  However,  implementation  of  the  DP  algoritluii 
for  the  one-period  time  lag  problem  is  an  important  first  step  towards  solving  larger,  more 
complex  problems.  The  steps  necessary  to  implement  the  DP  algorithm  to  solve  a  larger 
problem  are  essentially  the  same  as  those  required  to  solve  the  one-period  lag  problem. 
The  major  differences  between  the  one-period  time  lag  problem  and  those  with  longer 
time  lags  are  (1)  the  exponential  explosion  in  the  size  of  the  problem  (see  Table  6)  that 
comes  with  state  augmentation,  and  (2)  an  increase  in  the  complexity  of  the  reformulated 
problem  due  to  additional  terms  in  the  state  transition  equation.  Nevertheless,  the  initial 
effort  on  implementing  the  exact  DP  approach  has  provided  very  good  insight  into,  and 
an  appreciation  for,  issues  that  will  become  more  important  in  future  efforts  to  solve 
larger  problems.  Such  issues  include  the  practical  aspects  of  managing  computer 
memory  effectively,  writing  efficient  computer  codes  for  DP  algorithms,  and  the  choice 
of  software  and  hardware  for  implementing  and  running  the  programs.  Table  6  shows 
how  quickly  the  size  of  the  state  space  increases  as  time-lag  in  the  dynamics  increases  for 
a  company  strength  step  size  of  hve,  and  increases  of  three  idle  training  companies  per 
period  for  each  incremental  increase  in  time  lag.  Note  that  /  <  48  for  the  ten-period  time 
lag  in  Table  6  compares  favorably  with  the  constraint  /  <  50  for  idle  training  eompanies 
given  previously  in  Section  3.5. 
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Time  Lag  (/) 

x{t  - 1) 

m 

x{t) 

State  Space 

1 

21 

21 

21 

9,261 

2 

21^ 

24 

21 

222,264 

3 

21^ 

27 

21 

5.2x10^ 

4 

21^ 

30 

21 

1.2x10^ 

5 

21^ 

33 

21 

2.8x10^ 

6 

21^ 

36 

21 

6.5x10^° 

7 

2f 

39 

21 

1.5x10^^ 

8 

21^ 

42 

21 

3.3x10^^ 

9 

21^ 

45 

21 

1.5x10^^ 

10 

21^° 

48 

21 

1.7x10^® 

Table  6.  Increase  in  the  Size  of  the  State  Space  for  each  One-Period  Increase 
in  the  Time  Lag  of  the  Dynamic  Model 


DP  implementation  developed  here  was  based  on  a  backward  recursion,  although, 
since  the  problem  is  completely  deterministic,  a  forward  DP  algoritlnn  should  work  just 
as  well  (see  Bertsekas  [3],  pg.  31).  The  planning  horizon  of  48  weeks  starts  in  week  one 
and  ends  in  week  48.  The  augmented  state  for  the  one-period  time  lag  problem  is 
represented  by  x(t  - 1)].  For  each  week  t  of  the  one-period  time  lag  problem,  there 
are  441  (e.g.,  21x21)  objective  function  values,  denoted  J*[t,  I{t),  x{t  - 1)],  that  must  be 
determined  to  complete  the  matrix  of  optimal  objective  function  values,  denoted  by 
J*[r,*,*].  Each  one  of  these  /*[?,*,*]  values  is  determined  by  an  enumeration  of 
objective  function  values  for  each  element  of  the  1x21  array  of  allowable  company 
strengths,  denoted  by  X[x(t)],  and  the  maximum  7[r,*]  value  is  chosen  from  among  the 
twenty  one  cases. 

Objective  function  values  are  computed  recursively  back  in  time  starting  in  period 
48  and  ending  in  period  1.  The  J*[r,  /(r),  Jc(r- 1)]  array  of  optimal  objective  function 
values  for  the  planning  horizon  is  "built  up"  period-by-period  so  that  by  week  1  each 
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value  /*[!,*,*]  of  J*[l,  7(1),  x(0)]  represents  the  optimal  objective  function  value  for 
the  plamiing  horizon  for  initial  conditions  of  7(1)  idle  companies  and  x(0)  company 
strength. 

Two  optimal  objective  function  value  arrays  are  needed  for  each  period;  (1) 
J*[r,  7(r),  x(t-l)]  to  store  the  /*[?,*,*]  values  computed  for  the  current  period  t,  and 
(2)  J*[t  + 1,  7(r  + 1),  x(r)]  containing  the  values  /*[?  +  !,*,*]  computed  from  the 
previous  step  (i.e.,  for  the  next  period  t+1)  that  are  needed  to  compute  /*[?,*,*]  in  period 
t  (for  details  see  step  Week  t  below).  Once  all  the  values  are  computed  for 

the  values  in  J*[r  + 1,*,*]  may  be  discarded  and  replaced  by  the  elements  of 
the  array  that  were  just  computed.  This  last  operation  serves  as  the  preliminary 

step  for  working  backwards  for  one  period  in  the  planning  horizon. 

If  we  assume  a  one-period  "quality"  measure  value  of  zero  for  period  49,  then  the 
notation  for  computing  objective  function  values  J*[t,  1(1),  x(t-l)]  beginning  with 
week  49  is  as  follows: 

Week  49: 

J[A9,  7(49),  x(48)]  =  0. 

Week  48: 

/[48,  7(48),  x(47)]  =  max 

a:(48)  e  X[48,  7(48),  a:(47)] 


x(48) 


+  7[49,  7(49),  x(48)]  }, 


and  continuing  in  a  similar  fashion. 
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Week  t: 


J[t,  I(t),  x(t -1)]  =  max  I  — —  +  J[t +  1,  I(t +  1), 

I  x(t) 

x(l)  e  X[t,l{t),x{t-\)] 

for  t  = 

— ^  represents  the  one  stage  "reward"  for  week  t.  The  Week  t  performance  function 
x{t) 

denotes  the  general  fomi  for  computing  the  optimal  "reward-to-go"  from  the  current  stage 
t  to  last  stage  T.  I(t  +  1)  is  computed  from  the  state  transition  equation  for  each  element 
of  X[jc(r)],  given  the  state  [r,  /(r  +  1),  x(0]  and  recruit  arrivals  r(t)  and  r(r-l).  The 
optimal  company  strength  decision  x*(t)  is  determined  at  the  same  time  the  optimal 
objective  function  value  7* [?,*,*]  is  determined;  it  is  the  value  that  maximizes  the  right 
hand  side  of  each  equation  shown  in  steps  Week  49  to  Week  0.  If  there  is  a  tie  between 
two  or  more  values,  ties  are  broken  by  selecting  the  that  corresponds 

to  the  smallest  company  strength  decision  x* {t)  which  is  consistent  with  our  objective  of 
maximizing  training  "quality." 

The  DP  algoritlim  was  implemented  as  follows: 

STEP  0.  Initialize  the  Problem. 

Create:  one  optimal  decision  array  X*[x(r)]  for  each  period  t  = 

Create:  two  "reward-to-go"  arrays:  J*[?  + 1,  I{t  + 1),  x(0],  J*[f,  I{t),  x(t  - 1)]. 
Initialize:  J[49,  7(49),  x(48)]  =  [0]. 

Initialize:  the  1x49  array  of  recruit  arrivals;  RA[r(t)] . 

Initialize:  the  1x21  array  of  allowable  company  strengths;  X[x(0]- 
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STEP  1.  Implement  the  DP  Recursion. 


Begin:  in  Stage  48  and  work  BACKWARD  in  time  to  Stage  1, 
Do:  Tasks  (1.1)  tlirough  (1.8)  for  each  of  the  48  stages. 


1.  Do:  the  following  for  each  state  [t,  I{t  +  1),  x(f)]  of 
r[t,  m,  x(t-i)]. 

2.  Compute:  I(t  + 1)  for  each  element  of  X[x(r)]  given  [t,  I{t),  x(t  - 1)], 
r(t)  and  r(t  - 1)  according  to  the  following  two  cases  (see  Section  2.3 
for  details): 


A.  ForA;(*)<X:  I{t  +  l)  =  I(t)  + 


B.  For  ;c(=*')  =  X:  I(t  +  l)  =  1(1)  + 


r(r-l) 

r(t) 

_.x(r-l)_ 

_x(0. 

>(?-!)■ 

\r(t)] 

x(t  - 1) 

x(t) 

3.  Get:  the  value  from  J*[r  + 1,  I(t  + 1),  x(r)]  that  corresponds  to  state 
[t,  I(t  + 1),  j:(0],  using  the  rounded  value  of  I(t  + 1)  just  computed  in 
(1.2). 

4.  Compute:  one  stage  reward  — ^  for  each  element  of  X[x(0]- 


5.  Compute:  /[r,  I(t),  x(t  - 1)],  for  each  element  of  X[x(r)],  as  the  sum 
of(1.3)and(1.4). 


6.  Choose:  from  each  of  the  J\t,  I(t),  x(t- 1)]  values  computed  for 
X[x(0],  the  maximum  objective  function  value  J*[t,  I(t),  x(t  - 1)], 
and  record  the  corresponding  optimal  company  strength  decision 


no 


x*(t).  Ties  ai-e  broken  by  selecting  the  ,*]  that  corresponds  to 
the  smallest  company  strength  decision  x*(t). 

7.  Enter:  J*[t,  1(1),  x(t  - 1)]  and  x*  (t)  at  the  [t,  I(t),  x(t  - 1)]  position 
of  J*[r,  I(t),  x(t  - 1)]  and  X*[j:(r)]  arrays,  respectively.  Then, 

8.  Return:  to  (1.1). 


STEP  2.  Determine  Optimal  Company  Strength  Policy. 


Given:  an  initial  starting  point  of[l,  1(1),  x(0)]. 

Begin:  in  Stage  1  and  work  FORWARD  in  time  to  Stage  48. 
Do:  Tasks  (2.1)  tlirough  (2.5)  for  each  of  the  48  stages. 


1.  Look  Up:  the  corresponding  value  for  x*(t)  from  the  optimal  decision 
array  X*[jc(r)]  for  the  current  state  [t,  I(t),  x(t  - 1)]. 


2. 


Compute:  I(t  + 1)  =  I(t)  + 


r(t-l) 

x(t  - 1) 


r(t) 

x*(t) 


for  [t,  I(t),  x(f-l)],  x*(0,  r(t)  and  r(t-l). 


3.  Record:  the  value  of  x*(t)  for  week  f  in  71* 


*  * 
■^2  >•••’•^48 


}. 


4.  Replace:  the  current  values  of  [t,  I(t),  x(t- 1)]  by  I(t  + 1)  and  x* (t) 
just  determined.  Then, 

5.  Return:  to  (2.1). 


The  DP  algorithm  has  been  implemented  in  the  same  spreadsheet  environment 
used  for  the  SPH  and  MPH  procedures,  LOTUS  1-2-3  for  Windows  Release  4.  This 
enables  processing  times  for  the  two  methods  to  be  compared  using  the  same  type  of 
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computer  and  software  environment.  Computer  code  for  spreadsheet  implementation  of 
the  DP  algoritlun  is  given  in  Appendix  G,  along  with  numerical  examples  of  the 
J*[r,  7(0,  1)]  and  X*[x(r)]  amys. 

DP  Processing  Time 

The  processing  time  for  one  stage  using  the  dynamic  programming  algoritlun  took 
approximately  43  minutes,  bringing  the  total  rumiing  time  required  to  obtain  a  solution 
for  48  periods  to  approximately  34  hours  on  a  486/D66  desktop  eomputer.  However,  DP 
processing  time  can  be  substantially  improved  by  implementing  the  DP  algoritlun  on  a 
more  powerful  computer.  Even  so,  it  is  unlikely  that  the  DP  procedure  can  be 
successfully  used  for  quick-tum-around  analyses  required  to  support  "what-if '  scenarios 
that  often  arise  during  decision  analysis.  However,  the  DP  technique  has  a  major 
advantage  over  the  SPH-MPH  procedure  in  that  once  the  DP  results  are  obtained,  the 
"value"  for  every  feasible  company  strength  policy  (e.g.,  sequence  of  company  strength 
decisions)  is  known.  In  addition,  the  policies  themselves  are  very  easily  obtained  from 
the  DP  solution  as  a  simple  table  "look  up"  exercise.  Also,  the  system  could  "interpolate" 
for  scenarios  not  in  the  database. 

If  DP  results  for  a  sufficient  number  of  scheduling  scenarios  can  be  pre-computed, 
then  it  may  be  possible  to  incorporate  the  results  in  an  "expert  system"  to  support  training 
resource  scheduling  at  the  training  installation  level,  or  across  training  installations  as 
well.  An  expert  system  can  make  scheduling  results,  for  whatever  scenarios  are  in  the 
data  base,  "instantly"  available  to  system  users  on  virtually  any  386  or  486  desktop 
computer. 
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DP  Versus  Heuristic  Results 

Twelve  representative  examples  (of  the  441  DP  solutions  generated)  for  one- 
period  time  lag  problems  have  been  selected  to  compare  the  DP  and  heuristic  scheduling 
methods.  Table  7  gives  the  objective  function  values  obtained  using  DP  and  the  SPH- 
MPH  procedures,  and  the  effectiveness  of  the  heuristic  solutions  compared  to  the  DP 
solutions.  For  initial  conditions  I{t)  =  10  tlirough  I{t)  =  20,  the  differences  between  the 
DP  and  SPH-MPH  solutions  are  negligible  (see  Appendix  G). 

From  the  results,  we  notice  that  the  SPH-MPH  method  is  more  sensitive  to  initial 
conditions  than  the  DP  procedure.  This  is  indicated  by  the  variability  of  the  objective 
function  values  for  different  initial  company  strengths  (150,  200,  250),  for  a  given  initial 
idle  company  value.  Our  second  observation  is  that  the  objective  function  values  increase 
as  the  number  of  initial  idle  companies  increase  (as  expected).  Averaging  the  values 
suggests  that  the  SPH-MPH  method  (potentially)  determines  solutions  that  are  91% 
(approximately)  of  the  optimal  for  the  one-period  lag  problem. 


Test  Case 

lit) 

x{t  - 1) 

SPH-MPH 

DP 

%  Soln 

1 

0 

150 

.3047 

.3121 

98% 

2 

0 

200 

.2831 

.3103 

91  % 

3 

0 

250 

.2652 

.3027 

88% 

4 

1 

150 

.3071 

.3121 

98% 

5 

1 

200 

.2981 

.3113 

93% 

6 

1 

250 

.2721 

.3063 

89% 

7 

2 

150 

.2091 

.3121 

98% 

8 

2 

200 

.2942 

.3119 

94% 

9 

2 

250 

.2773 

.3084 

90% 

10 

3 

150 

.3108 

.3121 

99% 

11 

3 

200 

.2983 

.3121 

96% 

12 

3 

250 

.2831 

.3103 

91  % 

Table  7.  Scheduling  Results  for  the  One-Period  Lag  Problem 
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Figure  18  displays  the  "quality"  objective  function  values  by  solution  method  to 
allow  a  visual  comparison  of  results. 


Figure  18.  Comparison  of  Heuristic  versus  DP  Solutions  to  the  1 -Period  Lag  Problem 

Although  objective  function  values  for  both  methods  tend  to  decrease  with  increases  in 
company  strengths,  the  drop  in  performance  of  the  SPH-MPH  method  is  much  more 
dramatic  when  compared  to  results  from  the  DP  procedure  for  a  fixed  value  of  I{t) . 

Next,  the  DP  and  SPH-MPH  solution  methods  are  compared  on  the  basis  of 
company  strength  policies  (see  Figures  19  and  20)  and  idle  training  companies  (see 
Figures  21  and  22).  These  two  examples  are  representative  (for  the  twelve  cases 
considered)  of  "good"  and  "poor"  heuristic  solutions  based  on  solutions  that  are  98%  and 
88%  percent  effective,  respectively  (see  Table  7).  The  comparison  is  based  on  the  same 
initial  condition  for  idle  companies,  7(1)  =  0,  but  different  initial  company  strengths, 
X  (0)  =  150  and  x  (0)  =  250,  where  superscripts  1  and  2  denote  the  two  cases  we  are 
comparing.  The  optimal  objective  function  values  are  /^*[1,  1,  150]  =  0.3121  and 
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/^*[1,  1,  250]  =  0.3027,  and  the  corresponding  heuristic  objective  function  values  are 
0.3047  and  0.2652,  respectively. 

Figure  19  compares  DP  and  SPH-MPH  company  strength  decisions  for 
v'(0)  =  150,  and  Figure  20  makes  the  same  comparison  for  v^(0)  =  250.  From  Figures 
19  and  20,  the  impact  of  the  initial  company  strength  values  of  150  versus  250  is  much 
more  evident;  especially  for  the  heuristic  method. 

Conclusions 

From  the  tests  and  test  conditions  described  for  the  real-world  problem  scenarios 
of  Section  6.1,  we  estimate  that  the  SPFl-MPH  can  determine  solutions  that  are  87% 
(approximately)  of  the  utopian  bound  for  the  objective  funetion.  The  faet  that  these 
results  were  obtained  in  reasonable  time  on  a  486  microcomputer  establishes  the  potential 
value  of  the  heuristic  methods  to  basic  training  management  and  related  decision 
processes. 

For  the  one-period  time  lag  problem,  the  heuristic  methods  achieved  results  that 
were  91%  of  optimal  for  a  small  but  (seemingly)  representative  set  of  test  cases.  Based 
on  comparing  results  of  different  initial  company  strengths  for  the  one-period  time  lag 
problem,  it  is  expected  that  the  performance  of  the  heuristic  procedure  will  likely  decline 
relative  to  DP  as  problem  size  increases. 

Finally,  although  generating  the  DP  solution  is  computationally  intensive,  the  DP 
results  are  very  valuable.  Not  only  because  they  are  of  superior  quality  to  the  heuristic 
results,  but  also  because  the  DP  algorithm  generates  (1)  the  optimal  objective  function 
value,  and  (2)  the  decisions  to  take  in  each  period,  for  every  initial  state.  Once  obtained, 
albeit  at  a  (potentially)  very  high  cost,  the  results  are  very  easily  obtainable  via  a  simple 
table  "look  up”  procedure  but  impractical  for  "what-if  analysis. 
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7.  CONCLUSIONS 

This  dissertation  solves  a  complex  scheduling  problem  of  practical  interest  to  the 
United  States  Army  that  has  apparently  not  been  previously  reported  in  the  literature. 
Specifically,  the  problem  is  one  of  scheduling  reusable  training  resources  for  the  Army's 
Basic  Combat  Training  program  over  a  finite  planning  horizon  of  T  stages. 

7. 1  Summary  and  Contributions  of  Work 

One  contribution  of  this  work  to  military  systems  engineering  is  the  mathematical 
formulation  of  the  optimal  resource  scheduling  problem  that  accounts  for  important 
dynamics  of  the  basic  training  process.  Notable  dynamic  characteristics  of  the  training 
system  model  include: 

1 .  seasonal  arrival  of  recruits  that  represents  demand  for  training 
resources; 

2.  varying  training  company  strength;  and 

3.  varying  training  cycle  length. 

In  the  real-world  basic  training  problem,  recruit  arrival  is  a  random  process  which  makes 
the  demand  for  training  resources  stochastic.  However,  we  estimate  recruit  arrivals  ahead 
of  time  for  each  of  T  time  periods  which  makes  our  problem  deterministic. 

The  primary  optimization  (or  suboptimization,  as  the  case  may  be)  objective  of 
the  basic  training  problem  is  based  on  a  key  measure  of  training  system  performance; 
namely,  the  "quality"  of  the  training  program  as  measured  by  the  instructor-to-student 
ratio.  A  second  objective  (given)  is  the  minimization  of  training  program  cost;  also  a 
very  desirable  criterion.  Based  on  the  primary  objective,  two  decision  elements  are 
modeled  that  reflect  realistic  decisions  made  in  practice:  training  company  strength  and 
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training  cycle  length.  The  problem  is  to  detennine  the  appropriate  strength  and  cycle 
length  for  training  companies  at  the  begiiming  of  each  of  the  T  time  periods  to  meet 
deterministic  demand  for  training  resources  as  measured  by  recruit  arrivals. 

A  second  major  contribution  of  this  work  is  the  implementation  of  scheduling 
methods  for  solving  the  training  resource  problem.  Two  decision  models  are  considered. 
The  first  is  based  on  dynamic  programming',  an  exact  optimization  where  the  scheduling 
decisions  at  each  stage  of  the  planning  horizon  are  the  ones  that  maximize  the  sum  of  the 
training  quality  measure  for  the  current  stage  plus  the  best  value  for  the  objective  function 
(total  training  quality)  that  can  be  achieved  for  future  stages.  Computer  implementation 
of  the  dynamic  programming  teclmique  has  been  limited  to  a  simplified  version  of  the 
real-world  basic  training  problem  due  to  the  massive  size  of  the  original  problem. 

The  second  decision  model  consists  of  two  heuristics  applied  in  three  phases  to 
obtain  a  suboptimal  resource  scheduling  solution.  The  first  of  the  two  heuristics  is  a 
forward  single-pass  heuristic  (SPH)  that  generates  a  "good"  initial  feasible  training 
resource  schedule.  The  second  heuristic  is  a  backward  multi-pass  heuristic  (MPH)  that 
sequentially  (in  time)  and  iteratively  (in  decisions)  refines  the  initial  feasible  schedule 
until  no  further  improvements  can  be  made.  Based  on  experimental  results,  the  single- 
and  multi-pass  heuristics  generate  solutions  that  are  approximately  87%  effective  (based 
on  the  utopian  value  of  the  training  quality  performance  measure).  This  is  a  significant 
improvement  over  the  approximately  74%  effective  solutions  achieved  with  heuristics- 
used-in-practice  for  the  same  set  of  test  problems.  The  SPH  and  MPH  include  a  fully 
automated  heuristic  policy  iteration  step  that  is  motivated  by  the  policy  iteration  approach 
to  solving  dynamic  programs. 

A  third  major  contribution  of  this  dissertation  is  the  development  of  an 
operational  Decision  Support  System  for  Army  Basic  Combat  Training  Resource 
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Management  per  Year,  ox  ARMY,  that  fully  implements  the  SPH  and  MPH  heuristics  for 
solving  real-world  basic  training  resource  scheduling  problems.  The  decision  support 
system  has  been  designed  to  support  three  major  levels  of  decision  making:  strategic 
plaiming  at  the  Department  of  the  Army  level,  training  installation  management  at 
TRADOC  Headquarters,  and  operational  control  of  the  training  program  at  the  training 
installation.  Experiments  with  ARKdY\x20J0  demonstrated  that  the  system  can  be  used  to 
estimate 

•  minimal  training  structure  required  to  meet  current  and  future  amrual  training 
requirements; 

•  training  resource  utilization;  and 

•  basic  training  program  costs  including  the  overall  annual  cost  of  the  basic 
training  program,  the  annual  costs  of  specific  program  resources,  and  the 
training  program  cost  per  trainee. 

ARMY  also  supports  the  analyses  of  a  variety  of  training  installation  and  training 
program  management  issues,  such  as, 

•  examining  the  impact  of  consolidating,  closing,  or  reducing  the  size  of 
training  installations  on  the  Army's  ability  to  meet  future  training  missions; 

•  evaluating  the  economic  impact  of  different  resource  utilization  policies; 

•  evaluating  training  readiness  (as  a  function  of  training  capacity  and  training 
program  throughput)  by  varying  training  parameters,  such  as,  recruiting  levels, 
training  program  durations  (course  lengths),  or  levels  of  available  training 
resources;  and 
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•  forecasting  training  resource  requirements  for  the  basic  training  program  and 
estimating  basic  training  program  costs. 

Potential  benefits  of  the  ARMY  system  to  the  to  the  Army,  and  perhaps  to  the 
Department  of  Defense,  in  general,  include: 

•  improved  forecasting  of  training  resource  requirements; 

•  improved  training  resource  scheduling; 

•  improved  resource  utilization  through  tighter  control  of  facilities,  equipment, 
supplies,  and  manpower;  plus 

•  considerable  cost  savings  associated  with  each  of  the  improvements  to 
training  management  cited  above. 

7.2  Suggestions  for  Future  Research 

This  work  has  concentrated  mainly  on  the  formulation  of  a  model  for  the  Army's 
Basic  Combat  Training  program,  and  implementing  a  decision  model  for  scheduling  one 
major  training  resource;  basic  training  companies.  The  dissertation  concludes  with  a  list 
of  potential  research  areas  to  be  studied  in  the  future. 

•  In  the  model,  permit  recruits  to  be  backlogged  in  the  model  when  recruit 
arrivals  exceed  training  capacity.  This  means  that  demand  for  training 
resources  (i.e.,  training  companies)  can  be  backlogged,  and  therefore, 

Ij  (t)  <  0  is  permitted  in  the  model. 

•  Make  the  basic  training  model  more  realistic  by  incorporating  (1)  recruit 
failures,  (2)  recycling  of  recruits  who  fail  basic  training  and  (3)  account  for  the 
added  cost  of  recycling. 
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•  Incorporate  additional  direct  and  indirect  costs  into  the  Resource  Costing 

Module  of  the  system. 

•  Investigate  the  effect  of  incorporating  stochastic  processes  for  recruit  arrivals, 
recruit  failures,  and  recruit  recycling  on  demand  for  training  resources  and  on 
resource  scheduling. 

•  Extend  the  model  and  the  decision  support  system  to  include  other  Army 
training  programs  (AIT,  OSUT,  training  programs  for  commissioned  and 
noncommissioned  officers,  and  specialty  training  programs,  such  as.  Airborne 
School,  Ranger  School,  etc.),  and  training  programs  of  other  branches  of 
military  service  (Air  Force,  Navy,  and  Marines). 

•  Improve  the  efficiency  of  the  computer  code  for  the  single-  and  multi-pass 
heuristic  scheduling  algorithms. 

•  Extend  the  optimal  dynamic  programming  implementation  to  larger,  more 
realistic  scheduling  problems. 

In  conclusion,  the  model  of  the  basic  training  program  and  the  scheduling 
methodologies  presented  in  this  dissertation  will  hopefully  motivate  further  research 
efforts  in  this  important  area  of  military  operations  research. 
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Appendix  A: 

SUMMARY  OF  LITERATURE  SURVEY 

An  extensive  search  of  the  literature  was  made  for  papers  related  to  military 
training  programs,  training  resource  scheduling,  training  base  or  training  installation 
management,  military  base  closures  or  base  realignment,  and  other  relevant  issues.  The 
primary  sources  for  the  literature  search  were  the  Compendex  and  MathSci  databases,  and 
the  Defense  Technical  Information  Center  (DTIC).  Compendex  and  MathSci  are 
international  databases  referencing  over  4500  and  3200  journals,  conference  proceedings, 
and  technical  reports,  respectively,  in  such  areas  as  engineering,  mathematics,  operations 
research,  and  computing.  DTIC  is  a  central  depository  of  scientific  and  technical 
research,  and  data  collection,  for  the  Department  of  Defense.  The  MathSci  and  DTIC 
databases  were  searched  from  1980  through  1993  and  Compendex  from  1986  through 
1993  using  approximately  100  key  words  that  generated  more  than  28,000  citations  (see 
Table  8  below).  Initial  screening,  done  by  title,  identified  approximately  200  papers 
thought  to  have  some  connection  (however  remote)  to  the  basic  training  problem.  These 
200  were  further  screened  by  reading  the  abstracts.  This  step  reduced  the  number  of 
papers  to  approximately  eighty,  of  which  fifty  (or  so)  were  related  to  the  economic  lot¬ 
sizing  problem.  Not  one  of  the  remaining  thirty  papers  was  directly  related  to  the  basic 
training  problem  presented  here. 

Table  8  summarizes  the  literature  search.  The  value  in  each  cell  gives  the  number 
of  references  initially  obtained  by  keyword  and  year.  Totals  by  key  word  are  given  in  the 
right  most  column  and  the  last  row  totals  references  by  year. 
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Key  WordsAfears 


Air  Force 


Army  _ 


Army  Bases 


Army  Battalion  Training 


Army  Facilities 


Anny  Installations 


Anny  Recruitment 


Army  Reserve  Pers  Trainin 


Army  Training  Management 


Decision  Support  System 


Decision  Theory-Mil  Appl 


Decision  Theory-Mil  Purp 


DP  _ 


DP  Algorithm 


DSS 


DSS  Design 


DSS  Military  Application 


DSS  Military  Purposes 


Manpower 


Manpower,  Personnel,  Tng 


Manpower  Allocation _ 


Manpower  Analysis 


Manpower  Assessment _ 


Manpower  Costs  _ 


Manpower  Forecasting 


Manpower  Issues 


Manpower  Management 


Manpower  Modeling 


Manpower  Opt  _ 


Manpower  Planning 


Manpower  Plaiming  Model 


Manpower  Policies 


Manpower  Resources 


Manpower  Resources  Mgt 


Manpower  Scheduling _ 


Manpower  System  Costs 


Manpower  Trainmg _ 


Military  Engr-Math  Models 


Military  Engr-Personnel 


Military  Engr-Personnel  Tng 
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1 

1 
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6 
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98  105 
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Military  Applications 


Military  Base  Closures 


Military  Bases 


Military  Installations 


Military  Personnel 


Military  Training 


Navy  _ 


Recruit 


Recruiting 


Recruitment 


Resource  Allocation 


Resource  Allocation  Algo 


Resource  Allocation  Pblm 


Resource  Assignment  Pblm 


Resource  Balancing  Pblm 


Resource  Management 


Resource  Management  Model 


Resource  Modeling 


Resource  Plannmg 


Resource  Scheduling 


Resource  Scheduling  Costs 


Scheduling 


Scheduling  Algorithms 


Scheduling  Analysis 


Scheduling  Applications 


Scheduling  Automation 


Scheduling  Calculations 


Scheduling  Computer  Applic 


Scheduling  Computer  Simul 


Scheduling  Costs 


Scheduling  Heuristics 


Scheduling  Math  Models 


Scheduling  Management 


Scheduling  Problems 


Scheduling  Personnel 


Scheduling  Programs 


Scheduling-Military  Purposes 


Training 


Training  Activities  Schedule 


Training  Costs 


Training  Facilities 


Training  Installations 


246  1  249 


1 


179  I  291  I  328  I  301  I  306 


160  I  172 


983  1193 


19 


422  1  1657  1  1526 


136  I  138 


186  I  258 


30  3 


56  I  12  I  23  1  24 


1 


1 


1  6  7  3 


938  I  816  I  1059  I  1174  I  1030 
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3 


33 

35 

11 

3 

1 

1 

43  I  75 
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1 


1 
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Training  Management 

1 

1 

1 

3 

Training  Missions 

1 

1 

Training  Process 

1 

1 

Training  Program 

5 

6 

5 

4 

18 

38 

Training  Resources 

1 

1 

2 

Training  Scheduling  Algo 

3 

3 

Training  Strategies 

1 

1 

1 

3 

Training  Systems 

3 

1 

1 

5 

Training  System  Analysis 

1 

1 

Training  System  Design 

1 

1 

US  Air  Force 

2 

27 

19 

13 

5 

2 

2 

11 

81 

US  Air  Force  Pers 

1 

1 

US  Army 

2 

14 

5 

2 

5 

1 

12 

41 

US  Army  Installations 

1 

1 

US  Navy 

7 

12 

15 

4 

11 

1 

5 

3 

58 

TOTAL 

1283 

4317 

4323 

3215 

2969 

3945 

4154 

4201 

28406 

Table  8.  Literature  Search  Results 
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Figure  23.  Training  Base  Scenario  Decision  Process 
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Appendix  C: 

ILLUSTRATIVE  SESSION  WITH  THE  DECISION  SUPPORT  SYSTEM 

Training  Base  Scenario  11  is  used  to  illustrate  numerical  and  graphical  output 
from  the  ARMY  decision  support  system  (DSS).  Scheduling  results  are  based  on  1988 
training  installation  structure  and  (notional)  projected  recruiting  objectives  for  1989  and 
1990.  Initially,  there  are  130  training  companies  available  at  the  beginning  of  1989. 
Historical  data  is  used  to  "warm-up"  the  model  so  that  idle  companies  in  week  1  of  1989 
reflects  a  realistic  initial  starting  point.  The  training  company  balance  equation  given  by 
equation  (12)  of  Section  2.3  determines  the  number  of  idle  training  companies  in  each 
week.  Company  Strengthis)  and  training  Cycle  Length{s)  (Figure  24  below)  are 
initialized  at  their  lower  and  upper  bounds  of  150  recruits  and  10  weeks,  respectively. 
Training  company  deactivation  is  applied  at  the  beginning  of  each  year  of  the  planning 
horizon  (according  to  Scenario  1 1  (fifteen  companies  are  deactivated  at  the  beginning  of 
both  1989  and  1990) .  This  is  reflected  in  the  number  of  training  companies  available  at 
the  beginning  of  1989  and  1990,  115  and  100  companies,  respectively  (see  BCT  Co, 
Figure  24).  The  Less  Deactivated  Co  row  is  used  when  training  company  deactivation  is 
applied  across  the  planning  horizon.  The  Co  from  [10,  9,  8]  Wk  Cycle  rows  serve  as  the 
work  space  for  the  Resource  Scheduling  Algorithm.  Some  summary  statistics  are  given 
at  the  end  of  the  planning  horizon  in  Figure  24.  Figures  25  through  29  graph  idle  training 
companies  at  progressive  stages  of  the  heuristic  scheduling  procedure. 
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Training  Company  Shortfalls  Before  Beginning  Heuristic  Scheduling 
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Figijre  28.  Training  Company  Shortfalls  After  Deactivating  Training  Companies  in  Phase  II 


141 


Appendix  D: 

INDIRECT  TRAINING  PROGRAM  COSTS  1 


BASE  OPERATIONS  SUPPORT 

RASE  SUPPORT  SERVICES 
Administration 
Retail  Supply  Operations 
Maintenance  of  Installation  Equipment 
Other  Persormel  Support 

FACILITY  SUPPORT  SERVICES 
Utilities 

Maintenance  and  Repair  of  Real  Property 
Minor  Construction 


BASE  SUPPORT  SERVICES 

ADMINISTRATION 
ADP  Facilities  and  Equipment 
ADP  Services 

Analysis  &  Resources  Management 
Base  Audiovisual  Activities 
Base  Telephone  &  Telecommunications 
Chaplain  Activities 
Command  Element 

Comptroller  and  Accounting  &  Finance 
Correspondence  Control 
Dependent  Schools 
Equal  Employment  Opportunity 

RFTAIT,  SUPPLY  OPERATIONS 
Clothing  Issue  Points 
Fuel  Management 

Military  Clothing  &  Equipment  Sales 
Purchasing  &  Contracting 
Military  Persormel  Management 
Other  Dining  Costs 
Other  Persormel  Costs 


Other  Base  Services 
Morale,  Welfare  and  Recreation 
Unaccompanied  Housing 


Other  Engineer  Support 
Environmental  Compliance 


Headquarters  Administration  &  Command 
Inspector  General 

Internal  Review  &  Audit  Compliance 

Legal  Services 

Local  Automation 

Manpower  Management 

Printing  &  Reproduction 

Program  &  Budget  Activities 

Public  Affairs 

Records  Management 

Safety 

Persormel  &  Commrmity  Affairs 
Ration  Distribution 
Reenlistment  Activities 
Social  Activities 
Training 

Treatment  Programs 


ICourtesy  of  the  Directorate  of  Resource  Management  (DRM),  Fort  Huachuca,  Arizona. 


OTHER  BASK  OPERATIONS 
Base  Transportation 
Vehicle  Operations  &  Maintenance 
Civil  Disturbance  Activities 
Con'ections  &  Confinement 
Counterintelligence  Operations 
Disaster  Preparedness 
Household  Goods 
Installation  Traffic  Management 
Laundry  &  Dry  Cleaning 
Leased  Physical  Security  Vehicles 
Liaison  &  Apprehension 
Military  Police  Activities 
Museums 

Nuclear  &  Chemical  Activities 

MORATH.  WEJ.FARE  &  RECREATION 
Auto,  Craft  &  Hobby  Shops 
Bowling  Alley 

Child  Care  Centers  &  Activities 
Clubs,  Messes  &  Restaurants 
Theaters,  Marinas,  Golf  Courses  &  Pools 
Other  Non-appropriated  Resale  Activities 
Community  Support  Activities 
Entertainment  Tickets  &  Tours 
Other  Special  Services 
Family  Member  Employment  Program 
Family  Support  Programs 

UNACCOMPANIED  HOUSING 
Moving  Unaccompanied  Personnel 
Procurement,  Control,  Issue,  Repair  & 
Replacement  of  Unaccompanied  Housing 
Furnishings 


Other  Base  Support 
Physical  Security  Activities 
Plans,  Training  &  Mobilization  Support 
Rail  Services 

Reserve  Component  Support 

Security  Administration  Support 

Security  Guards  &  Police 

Security  Operations 

Small  Security  Equipment  Purchases 

Small  Security  Equipment  Installation 

Traffic  Control 

Training  Devices 

Training  Facilities 

Water  Port  Services 

Financial  Planning-Consumer  Affairs 

Foster  Care 

Libraries 

Recreation  Activities  &  Centers 

Relocation  Services 

Selected  Medical  &  Dental  Clinics 

Social  Activities  &  Support  Groups 

Sports  Programs 

Welfare  Funds 

Youth  Activities 

Youth  Development  Activities 

Unaccompanied  Housing  Activities 
Operation  of  Unaccompanied  Housing 
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FACILITY  SUPPORT  SERVICES 
TITTTJTTES 

Air  Conditioning  &  Refrigeration  Plants 
Electric  Service  &  Systems 
Other  Utilities 
Sewer  Service  &  Systems 

MAINTENANCE  &  REPAIRS 
OF  REAT.  PROPERTY  IMRPl 
Buildings 

Facilities  Engineering  Shops 
Grounds 

Miscellaneous  Maintenance 
MRP  of  Inactive  Installations 

MAINTENANCE  OF  INSTALLATION 

EQUIPMENT 

Audiovisual  Equipment 

Chaplain  Equipment 

Band  Equipment 

Electronics  Equipment 

Engineering  Equipment 

Communications  Equipment 

ADP  Equipment 

Food  Service  Equipment 

Training  Equipment 

OTHER  ENGINEERING  SUPPORT 
Custodial  Services 

Engr.  Public  Works  and  Management 
Engineering  Support  (Inactive  Install) 
Equipment 

MINOR  CONSTRUCTION 
Alterations  &  Construction  (Active  Install) 

OTHER  PERSONNEL  SUPPORT 
Bands 

Civilian  Personnel  Management 
Contractor  Food  Operations 
Dining  Facilities 


Steam  &  Hot  Water  Heating  Plants 
Utilities  at  Inactive  Installations 
Water  Service  Systems 


Railroads 
Surfaced  Areas 
Tool  Issue 
Utility  Systems 

Suspense  Accounts  (Engr  Shops  &  Tools) 


General  Equipment 

Non-Tactical  Equipment  c&  Vehicles 

Other  Commodities 

Other  Support  Equipment 

Personnel  Support  Equipment 

Rail  Equipment 

Airlift  Support  Equipment 

Unaccompanied  Personnel  Furnishings 


Self-Service  Centers 
Stock  &  Supply  Operations 
Stock  Distribution 
Storage  &  Warehousing 

Alterations  &  Construction  (Inactive  Install) 

Education 
Food  Service 
Information  Activities 
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Appendix  E: 

COST FACTORS 

Resource  cost  data  from  the  1993  Fort  Benning  Study,  discussed  in  Chapter  5,  is 
used  to  compute  fixed  and  variable  training  resource  cost  factors.  Results  are  given 
below.  Chapter  6  illustrates  how  the  cost  factors  may  be  applied  to  estimate  training 
resource  costs  for  training  resource  schedules  obtained  via  the  heuristic  scheduling 
approaches  of  Chapter  4. 

Fixed  costs  are  generally  assessed  at  the  beginning  of  each  year  (YR)  of  the 
planning  horizon  based  on  the  number  of  training  battalions,  or  the  number  of  training 
companies,  available  at  the  beginning  of  each  year.  Fixed  and  variable  costs  are 
computed  by  training  battalion  (BN)  and  training  company  (CO).  Variable  costs  are  per 
recruit  are  also  computed.  Costs  per  start  for  battalions  and  training  companies  reflects 
the  cost  per  training  cycle  for  those  units,  where  a  battalion  cost  is  assessed  for  each  set  of 
five  training  companies  that  start  training  in  a  given  week  (see  Section  6.1  for  further 
details).  Cost  factors  are  summarized  for  each  cost  item  at  the  end  of  Appendix  E. 


PERSONNEL  IN-PROCESSING 


Cost  Factors  and  Computations 

$/bn/yr 

$^n/start 

$/co/yr 

$/co/start 

Civilian  employee  for  soldier  in-processing: 

$27,000 

General  Supplies: 

$3,000/C0. 

Assuming  200  recruits  per  company, 

$3,000/C0  ,  • 

- - - =  $15/ Recruit 

200  Recruits/CO 

$15 

Organizational  Clothing: 

$14,000/C0. 

Assuming  200  recruits  per  company, 

$14,000/C0 

- ^ - =  $70/Recruit 

200  Recruits/CO 

$70 

Subtotal 


$27,000 


$85 


BASIC  TRAINING  SUPPORT 
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Medical  Supplies  Issue; 

$5,000/C0. 

Assuming  6  starts  per  year, 

$5.000/CO/YR  ,  , 

- - - - - =  $833/CO/Start 

6  Starts/ YR 

$833 

Subtotal 

$83,000 

$2166 

$85 

SUPPLY  OPERATIONS 


Cost  Factors  and  Computations 

$^n/yr 

$/bn/start 

$/co/yr 

$/co/start 

$/recruit 

Sewing  Machine  Operator: 

$19,000 

Tailor: 

$20,000 

Clerk  to  transcribe  data  and  process  records: 

$19,000 

Subtotal 

$61,000 

MAINTENANCE  OF  MATERIALS:  Direct  Support  (DS)  &  General  Support  (GS) 


Cost  Factors  and  Computations 

$/bn/yr 

$/bn/start 

$/co/yr 

$/co/start 

$/recruit 

Worker  for  Organizational  Maintenance: 

$17,000 

Workers  for  DS/GS  Maintenance: 

$51,000 

Weapons  Maintenance: 

$14,000/00. 

Assuming  200  recruits  per  company, 

$14,000/C0 

- =  $70/Recruit 

200  Recruits/CO 

$70 

48 


Maintenance  of  Load-Bearing  Equipment: 

$2,000/C0. 

Assuming  200  recruits  per  company, 

$2,000/C0 

- =  $10/ Recruit 

200  Recruits/CO 

$10 

DS/GS  Repair  Parts: 

$42,000/C0. 

Assuming  200  recruits  per  company, 

$42,000/C0 

- - - =  $2 10/ Recruit 

200  Recruits/CO 

Subtotal 

$68,000 

TRANSPORTATION  SERVICES 


Cost  Factors  and  Computations 

$^n/yr 

$^n/start 

$/co/yr 

$/co/start 

$/recruit 

Training  Company  Driver  Costs: 

$56,000 

Administrative  Vehicle  Costs: 

$  1 5,000/BN  =  $  1 5,000/5  CO. 

Assuming  6  starts  per  year, 

$15,000/BN/YR  ,  , 

- - - - - =  $2500/BN/Start 

6  Starts/ YR 

$2500 

Training  Company  Vehicle  Costs: 

$11,000/C0. 

Assuming  6  starts  per  year, 

$ii,ooo/co/yr  ,  , 

- - - - - =  $1833/CO/Start 

6  Starts/YR 

$1833 

Subtotal 


$2500 


$56,000 


$1833 
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LAUNDRY  SERVICES 


Cost  Factor  and  Computations 

$^n/yr 

$/bn/start 

$/co/yr 

S/co/start 

$/recruit 

Laundry  Services: 

$145 

$29.000/C0. 

Assuming  200  recruits  per  company, 

$29.000/C0 

- =  $145/ Recruit 

200  Recruits/ CO 

Subtotal 

$145 

FOOD  SERVICES 


Cost  Factor  and  Computations 

$^n/yr 

$/bn/start 

$/co/yr 

S/co/start 

$/recruit 

Food  Services: 

$933.000/C0, 

Assuming  6  starts  per  year  and  200  recruits  per 

company, 

$933,000/CO/YR 

$778 

.  ..  .“4*//v3y  jivvCruii 

(6  Starts/YRj  (200  Recruits/CO) 

Subtotal 

$778 

PERSONNEL  SUPPORT 


Cost  Factor  and  Computations 

$/bn/yr 

$/bn/start 

$/co/yr 

$/co/start 

$/recruit 

Clerk  for  Personnel  Support: 

$26,000 

Subtotal 

$26,000 
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AMMUNITION 


Cost  Factor  and  Computations 

$^n/yr 

$/bn/start 

$/co/yr 

$/co/start 

$/recruit 

Ammunition  Issued  to  Recruits  for  Oualification  on 

Individual  Weapon  (Ml  6): 

ball:  375  rounds/recruit  at  $0.27/round 

tracer:  10  rounds/recruit  at  $0.27/round 

total:  $  104/recruit 

$104 

Ammunition  Issued  to  Recruits  for  Qualification  on  Sauad 

Assault  Weapon  (SAW): 

linked:  287  rounds/recruit  at  $0.47/round 

tracer:  317  rounds/recruit  at  $0.47/round 

total:  $284/recruit 

$284 

Ammunition  Issued  to  Trainina  Comoanv  Cadre  for 

Instruction  on  Individual  Weapon  (Ml 6): 

ball:  264  rounds/recruit  at  $0.27/round 

tracer:  30  rounds/recruit  at  $0.27/round 

total:  $51/CO/Start 

$80 

Ammunition  Issued  to  Trainina  Comnanv  Cadre  for 

Instruction  on  Squad  Assault  Weapon  (SAW): 

linked:  0  rounds/recruit  at  $0.47/round 

tracer:  108  rounds/recruit  at  $0.47/round 

total:  $5I/CO/Start 

$51 

Subtotal 

$131 

$388 
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UTILITIES 


Cost  Factor  and  Computations 

$^n/yr 

$/bn/start 

$/co/yr 

$/co/start 

$/recruit 

Annual  utilities  costs  of  $309,000  include: 

electricity  ($156,000); 

natural  gas  ($132,000); 

water  service  ($1 1,000); 

sewage  service  ($1 1,000) 

Assuming  6  starts  per  year, 

$309,000/BN/YR  ,  , 

- - - - - =  $51,500/BN/Start 

6  Starts/ YR 

$51,000 

Subtotal 

$51,500 

SUMMARY 


Cost  Factors 

$/bn/yr 

$/bn/start 

S/co/yr 

S/co/start 

$/recruit 

Personnel  In-Processing: 

$27,000 

$85 

Basic  Training  Support: 

$83,000 

$2166 

$345 

Supply  Operations: 

$61,000 

Maintenance  of  Materials: 

$68,000 

$290 

Transportation  Services: 

$2500 

$56,000 

$1833 

Laundry  Services: 

$145 

Food  Services: 

$788 

Personnel  Support: 

$26,000 

Ammunition: 

$131 

$388 

Utilities: 

$55,500 

GRAND  TOTAL: 

$110,000 

$58,000 

$211,000 

$4130 

$2041 
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Appendix  F : 

ADDITIONAL  TRAINING  PROGRAM  COST  ESTIMATES 

Cost  estimation  is  an  important  element  of  long-range  planning  and  important  to 
the  allocation  of  training  resources  to  support  basic  combat  training.  The  cost  module  of 
the  ARMY  system  can  support  these  processes,  as  well  as,  provide  cost  estimates  of 
current  training  program  operations  based  on  realistic  training  resource  utilization. 
Examples  of  other  types  of  costs  that  may  be  estimated  using  the  ARMY  system  are  given 
below  in  Tables  9,  10,  and  11.  The  cost  estimates  are  based  on  training  resource 
scheduling  results  from  Scenario  11. 

Tables  9  and  10  summarize  annual  costs  for  1989  and  1990,  respectively.  Table 
1 1  features  additional  cost  estimates  and  compares  the  results  from  the  two  years  of  the 
planning  horizon  by  annual  change  and  the  percent  change. 
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1989  Training  Costs 

$/bn/yr 

$/bn/start 

$/co/yr 

$/co/start 

$/recruit 

TOTAL 

Personnei  in-Processing 

$621,000 

$10,403,823 

$11,024,823 

Basic  Training  Support 

$1,909,000 

$1,416,564 

$42,227,282 

$45,552,846 

Supply  Operations 

$7,015,000 

$7,015,000 

Maintenance  of  Materials 

$7,820,000 

$35,495,397 

$43,315,397 

Transportation 

$327,000 

$6,440,000 

$1,198,782 

$7,965,782 

Laundry 

$17,747,698 

$17,747,698 

Food  Service 

$96,449,560 

$96,449,560 

Personnel  Support 

$2,990,000 

$2,990,000 

Ammunition 

$85,674 

$47,490,393 

$47,576,067 

Utilities 

$7,259,400 

$7,259,400 

TOTAL 

$2,530,000 

$7,586,400 

$24,265,000 

$2,701,020 

$249,814,153 

$286,896,573 

Table  9.  Summary  of  1989  Costs  for  Scenario  1 1 


1990  Training  Costs 

$/bn/yr 

$/bn/start 

$/co/yr 

$/co/start 

$/recruit 

TOTAL 

Personnel  In-Processing 

$540,000 

$9,791,834 

$10,331,834 

Basic  Training  Support 

$1,660,000 

$1,290,936 

$39,743,324 

$42,694,260 

Supply  Operations 

$6,100,000 

$6,100,000 

Maintenance  of  Materials 

$6,800,000 

$33,407,432 

$40,207,432 

Transportation 

$298,000 

$5,600,000 

$1,092,468 

$6,990,468 

Laundry 

$16,703,716 

$16,703,716 

Food  Service 

$90,776,057 

$90,776,057 

Personnel  Support 

$2,600,000 

$2,600,000 

Ammunition 

$78,076 

$44,696,840 

$44,774,916 

Utilities 

$6,615,600 

$6,615,600 

TOTAL 

$2,200,000 

$6,913,600 

$21,100,000 

$2,461,480 

$235,119,203 

$267,794,283 

Table  10.  Summary  of  1990  Costs  for  Scenario  1 1 


154 


Cost  Summary 

1989 

1990 

Difference 

%  Change 

Total  Cost 

$286,896,573 

$267,794,283 

($19,102,290) 

-6.66% 

Total  Fixed  Cost  (BN,  CO) 

$26,795,000 

$23,300,000 

($3,495,000) 

-13.04% 

Total  Variable  Cost  (BN,  CO,  Recruit) 

$260,101,573 

$244,494,283 

($15,607,290) 

-6.00% 

Total  Cost  by  Resource 

Personnel  In-Processing 

$11,024,823 

$10,331,834 

($692,990) 

-6.29% 

Basic  Training  Support 

$45,552,846 

$42,694,260 

($2,858,586) 

-6.28% 

Suppiy  Operations 

$7,015,000 

$6,100,000 

($915,000) 

-13.04% 

Maintenance  of  Materials 

$43,315,397 

$40,207,432 

($3,107,965) 

-7.18% 

Transportation 

$7,965,782 

$6,990,468 

($975,314) 

-12.24% 

Laundry 

$17,747,698 

$16,703,716 

($1,043,982) 

-5.88% 

Food  Service 

$96,449,560 

$90,776,057 

($5,673,504) 

-5.88% 

Personnel  Support 

$2,990,000 

$2,600,000 

($390,000) 

-13.04% 

Ammunition 

$47,576,067 

$44,774,916 

($2,801,151) 

-5.89% 

Utilities 

$7,259,400 

$6,615,600 

($643,800) 

-8.87% 

Average  Variable  Cost  Per  Training  Cycle 

$384,745 

$397,135 

$12,390 

3.22% 

BN  Variable  Cost  per  Training  Cycle 

$2,040 

$1,946 

($93) 

-4.58% 

CO  Variable  Cost  per  Training  Cycle 

$726 

$693 

($33) 

-4.58% 

RECRUIT  Variable  Cost  per  Training  Cycle 

$381,979 

$394,495 

$12,516 

3.28% 

Avg.  Variable  Cost  per  CO  per  Cycle  by  Resource 

Basic  Training  Support 

$381 

$363 

($17) 

-4.58% 

Transportation 

$322 

$308 

($15) 

-4.58% 

Ammunition 

$23 

$22 

($1) 

-4.58% 

Average  Program  Cost  Per  Recruit 

$2,344 

$2,325 

($19) 

-0.82% 

Average  Cost  per  Recruit  by  Resource 

Personnel  In-Processing 

$90 

$90 

($0) 

-0.43% 

Basic  Training  Support 

$372 

$371 

($2) 

-0.42% 

Supply  Operations 

$57 

$53 

($4) 

-7.61% 

Maintenance  of  Materials 

$354 

$349 

($5) 

-1.37% 

Transportation 

$65 

$61 

($4) 

-6.76% 

Laundry 

$145 

$145 

$0 

0.00% 

Food  Service 

$788 

$788 

$0 

0.00% 

Personnel  Support 

$24 

$23 

($2) 

-7.61% 

Ammunition 

$389 

$389 

($0) 

-0.01% 

Utilities 

$59 

$57 

($2) 

-3.17% 

Table  11.  Comparison  of  Annual  Costs  for  Scenario  1 1 
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